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tran j-feru late: Co ASH ligase activity (enzyme activity I), trans- 
feruloyl SCoA hydratase activity (enzyme activity II), and 4-hydroxy-3- 
methoxyphenyl-/?-hydroxy-propionyl SCoA (HMPHP SCoA) cleavage 
activity (enzyme activity HI). Conveniently the enzymes are provided 
by Pseudomonas ftuorescens Fe3 or a mutant or derivative thereof. 
Polypeptides with enzymes activities II and HI and polynucleotides 
encoding said polypeptides. Use of said polypeptides or said polynu- 
cleotides in a method for producing vanillin. 
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PRODUCTION OF VANILLIN 

The present invention relates principally to the production of vanillin (4- 
hydroxy-3-methoxy benzaldehyde) , particularly to the production of vanillin 
5 other than by extraction from the Vanilla pod. 

Vanillin is an important food and drink flavouring agent and a major 
flavour component of natural vanilla from the Vanilla pod. The use of 
natural vanilla is limited by its high price. Synthetic vanillin, commonly 

10 derived from sulphite liquors produced during the processing of wood pulp 
for paper manufacture, is frequently used as a low-cost vanilla substitute. 
Alternative biological processes for the production of natural vanillin and 
allied flavourings would have considerable industrial value and utility, 
most particularly if such processes could facilitate the production of 

15 vanillin and/or allied flavourings directly in a fermented food or beverage. 

The mechanism of vanillin biosynthesis in Vanilla remains substantially 
uncharacterised. M.H. Zenk (Anal. Z Pflanzenphysiol 53, 404-414 
(1965)) showed that vanillin was derived from fra/ts-ferulate (4-hydroxy-3- 

20 methoxy-mmy-cinnamate) and proposed a mechanism analogous to the 
classical 0-oxidation of fatty acids, with cleavage of a /3-keto thioester to 
produce acetyl SCoA and vanilloyl SCoA (4-hydroxy-3-methoxybenzoyl 
SCoA) and subsequent reduction and CoASH release to generate vanillin. 
C. Funk and P.E. Brodelius (Plant Physiol. 94, 95-101; 102-108 (1990); 

25 99, 256-262 (1992)), proposed a different route, in which the 4-hydroxy 
group of trans- ferulate became successively methylated and demethylated 
during the pathway of vanillin biosynthesis; however, the detailed 
enzymology was not elucidated. In potato tubers and in the fungus. 
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Polyporus hispidus (C.J. French, CP. Vance and G.H.N. Towers, 
Phytochemistry 15, 564-566 (1976)), in cell cultures of Lithospermum 
erythrorhizon (K. Yazaki, L. Heide and M. Tabata, Phytochemistry 30, 
2233-2236 (1991)) and in cell cultures of carrot (J.-P. Schnitzler, J. 

5 Madlung, A. Rose and H.U. Seitz, Planta 188, 594-600 (1992)), evidence 
was obtained from in vitro studies that the corresponding analogue of 
vanillin, 4-hydroxybenzaldehyde, was an intermediate in the formation of 
4-hydroxybenzoate from 4-coumarate (4-hydroxy-fra/is-cinnamate). There 
was no requirement for ATP or CoASH, thus apparently ruling out a /3- 

io oxidation mechanism. Further studies with cell-free extracts of 
Lithospermum erythrorhizon, however, have in contrast recently 
established the presence of a /3-oxidation route for the conversion of 4- 
coumarate to 4-hydroxybenzoate (R. Loscher and L. Heide, Plant Physiol 
106, 271-279 (1994)); in this case, the conversion was dependent on ATP, 

15 Mg 2+ ions and NAD + and proceeded via 4-hydroxybenzoyI SCoA, without 
the intermediate formation of 4-hydroxybenzaldehyde. 

In the Gram-negative bacterium, Pseudomonas acidovorans, rrans-ferulate 
was shown to be catabolised to vanillate and acetate, apparently via 

20 vanillin (A. Toms and J.M. Wood, Biochemistry 9, 337-343 (1970)). 
Although in cell-free extracts NAD* was necessary for the oxidation of 
vanillin to vanillate and for the further oxidation of vanillate to 
protocatechuate and formate, no mention was made of any other cofactor 
requirements. Further studies of ferulate utilisation in Pseudomonas 

25 species have been reported (V. Andreoni and G. Bestetti, FEMS 
Microbiology Ecology S3, 129-132 (1988); T. Omori, K. Hatakeyama and 
T. Kodama, AppL Microbiol. Biotechnol. 29, 497-500 (1988); Z. Huang, 
L. Dostal and J.P.N. Rosazza, AppL Env. Microbiol. 59, 2244-2250 
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(1993)); however, these have not sought to elucidate further the 
mechanism of the two-carbon cleavage of ferulate. Zenk et al (1980) 
Anal, Biochem. 101, 182-187 describe a procedure for the enzymatic 
synthesis and isolation of cinnamoyl-CoA thioesters using a bacterial 
5 system. In contrast, the enzymology and genetics of the utilisation of 
simple benzene derivatives, including benzoic acids and phenols, by 
Pseudomonas have been intensively studied (T.K. Kirk, T\ Higuchi and 
H.-M. Chang (eds.), Lignin biode gradation", CRC Press, Boca Raton, 
Fla, USA (1980); D.T. Gibson (ed.), Microbial degradation of organic 

10 compounds", Marcel Dekker, New York (1984); J.L. Ramos, A. 
Wasserfallen, K. Rose and K.N. Timmis, Science 235, 593-596 (1987); 
C.S. Harwood, N.N. Nichols, M.K. Kim, J.L. Ditty and R.E. Parales, 
7. BacterioL 176, 6479-6488 (1994); S. Romerosteiner, R.E. Parales, 
C.S. Harwood and J.E. Houghton, 7. BacterioL 176, 5771-5779 (1994); 

15 J. Inoue, J. P. Shaw, M. Rekik and S. Harayama, 7. BacterioL 177, 1196- 
1201 (1995)). 

A survey of potential microbial routes to aromatic aldehydes, including 
routes (i) from r/ww-cinnamic acids, (ii) from benzoic acids by reduction 
20 and (iii) by conversion of aromatic amino acids to phenylpyruvic acids 
followed by treatment with base, has been presented by J. Casey and R. 
Dobb (Enzyme Microb. TechnoL 14, 739-747 (1992)). 

US 5,128,253 describes a method of producing vanillin from fenilic acid 
25 by various microorganisms and extracts thereof or enzymes derived 
therefrom in the presence of a suiphydryl compound but does not disclose 
what any of the enzymes involved in the conversion of ferulic acid to 
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vanillin are. US 5,279,950 is a continuation-in-part application of US 
5,128 253 which additionally describes that Vanilla calluses can be used 
in the process. 

5 WO 94/13614 describes the production of vanillin from ferulic acid by the 
action of Vanilla root material and makes use of an adsorbent, such as 
charcoal, to extract vanillin but does not disclose the specific enzymes 
involved. 

10 EP 0 453 368 describes that a culture of Pycnoporus can convert 
/rans-ferulic acid into vanillin but does not disclose the specific enzymes 
involved. 

WO 94/02621 describes the production of vanillin from *ra/i5-feruiic acid 
15 by the action of a lipoxygenase enzyme. EP 0 405 197 describes the 
production of vanillin from eugenol/isoeugenol by bacteria from the 
genera Serratia, Klebsiella and Enterobacier by oxidation. 

Vanillin may also be produced from phenolic stilbenes as is mentioned in 
20 Hagedorn & Kaphammer (1994) Ann. Rev, Microbiol 48, 773-800. 

Vanillic acid is also a useful compound as it can be polymerised into 
oligomers or used as a monomer in the synthesis of polyesters; similarly 
p-hydroxybenzoic acid is also useful for polymer synthesis. 

25 

A first aspect of the invention provides a method of producing vanillin 
comprising the steps of 
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(1) providing rrans-ferulic acid or a salt thereof; and 

(2) providing trans- ferulate:CoASH ligase activity (enzyme activity I), 
rran5-feruloyl SCoA hydratase activity (enzyme activity II), and 4- 
hydroxy-3-methoxyphenyl-/3-hydroxy-propionyl SCoA (HMPHP 

5 SCoA) cleavage activity (enzyme activity III). 



The advantages of the present invention over chemical synthesis or 
extraction from the Vanilla pod include (i) economic advantage over 
extraction from Vanilla pod and freedom from geographical dependence 

io on Vanilla growing areas; (ii) the ability to produce vanillin by a natural 
process, involving biological catalysts; (iii) the benefits of generating a 
natural flavour in situ in a fermented food or beverage, if the genes are 
expressed in appropriate food-grade hosts - eg lactic acid bacteria or 
yeasts; and (iv) the possibility of expanding the range of plants in which 

15 vanillin and related substances might be produced and from which they 
might be extracted. These and other examples of the methods of the 
invention are described in more detail below. 



We have determined the mechanism of chain-shortening of rra/rs-ferulate 
20 (trans-femlic acid) by a strain of Pseudomonas fluorescens (named Ps. 
fluorescens biovar. V, strain AN103 and which we have abbreviated at 
some points to AN 103) isolated from soil. Our data indicate clearly that 
vanillin (4-hydroxy-3-methoxy benzaldehyde) is an intermediate and that 
the mechanism does not involve ^-oxidation. The vanillin pathway of Ps. 
25 fluorescens biovar. V, strain AN103 is described in Figure 1. 
7nms-ferulic acid (or a salt thereof) is interconverted with t rans- feruloyl 
SCoA in the presence of Co ASH; rrans-feruloyl SCoA is interconverted 
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with 4-hydroxy-3-methoxyphenyl-/^hydroxypropionyl SCoA (HMPHP 
SCoA); and HMPHP SCoA is interconverted with vanillin. 



For convenience, fra/w-feruIate:CoASH ligase activity is called enzyme 
5 activity I, rra/w-feruloyl SCoA hydratase activity is called enzyme activity 
II; and HMPHP SCoA cleavage activity is called enzyme activity III. The 
interconversions performed by these enzyme activities is shown in Figure 
1. 



10 The method of producing vanillin provided by the invention therefore 
includes the steps of exposing rra/zs-ferulic acid or a salt thereof to enzyme 
activity I and forming a product, exposing the said product of enzyme 
activity I to enzyme activity II to form a product and exposing the said 
product of enzyme activity II to enzyme activity III to form a product. 

15 

7>a/75-ferulic acid or a salt thereof may be provided directly, for example 
by supplying pre-prepared /rans-ferulic acid or a salt thereof, or it may be 
provided indirectly, for example by supplying a precursor of rrarw-ferulic 
acid or a precursor of a salt of franMeruiic acid and means to convert the 
20 said precursor into frans-ferulic acid or a salt thereof. As is described in 
more detail below, it is convenient if the precursor is an ester of trans- 
ferulic acid and the means to convert said ester is a suitable esterase. 



By "providing rm/i5-feruiate:CoASH ligase activity (enzyme activity I), 
25 rraw-feruloyl SCoA hydratase activity (enzyme activity II), and 4- 
hydroxy-3-methoxyphenyl-/3-hydroxy-propionyl SCoA (HMPHP SCoA) 
cleavage activity (enzyme activity III)^ we include the provision of the 
enzyme activities in any suitable form to effect the said production of 
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vanillin. For example, as is discussed in more detail below, the method 
of the invention specifically includes, but is not limited to, the provision 
of the enzyme activities (a) by intact or permeabilised Ps. fluorescens 
biovar. V, strain AN 103 or a mutant thereof, (b) at least one of enzyme 

5 activities II or III of which is in a form substantially free of cellular 
material, (c) by intact or permeabilised cells in culture, particularly 
microorganisms, which have been genetically modified to contain genes 
which encode enzyme activities II or III (for example, food grade 
microorganisms such as lactic acid bacteria and brewing yeast), and (d) by 

10 plants which have been genetically modified to contain genes which 
encode said enzyme activities. 



It is preferred if means for converting vanillin to a non-vanillin product 
is absent or reduced. Of course, the enzyme activity III is not such a 

15 means. Conveniently, these enzyme activities are provided by the soil 
bacterium Pseudomonas fluorescens biovar. V, strain AN 103 the said 
bacterium being that deposited under the Budapest Treaty at the National 
Collection of Industrial and Marine Bacteria Limited, AURIS Business 
Centre, 23 St. Machar Drive, Aberdeen AB2 1RY, Scotland under 

20 Accession No NCIMB 40783, or a mutant or variant thereof. By "mutani 
or variant thereof" we include any mutant or variant of the said bacterium 
provided that the bacterium retains the said enzyme activities whether or 
not at the same levels. It will be appreciated that the said enzyme 
activities can be retained even if the genes encoding said enzymes are 

25 mutated. For example, mutants which constitutively express (as opposed 
to conditionally or inducibly express) the said enzyme activities are 
particularly useful mutants of Ps. fluorescens biovar. V, strain AN 103, as 
are variants in which one or more of the enzymes with the said activities 
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exhibit more favourable kinetic characteristics (for example, an increased 
turnover number or a decreased K^). 

When Ps. fluorescens biovar. V, strain AN103 is growing on 
5 rrans-ferulate it will derive maximum benefit if vanillin is catabolised 
further in order to provide more energy. However, in order to maximise 
the production of vanillin by Ps. fluorescens biovar. V, strain AN 103 it 
is preferred that the means for converting vanillin to a non-vanillin 
product is absent or reduced. We have found that in Ps. fluorescens 
10 biovar. V, strain AN103 vanillin is converted to vanillic acid or a salt 
thereof by vanillin: NAD* oxidoreductase. It is preferred if a mutant of 
Ps. fluorescens biovar. V, strain AN 103 wherein the vanillin: NAD + 
oxidoreductase activity is absent or reduced is used in the method. Such 
a mutant can be made using a gene replacement strategy with a disrupted 
1 5 vanillin: NAD oxidoreductase gene, or a sequence of DN A from which this 
gene has been deleted. Gene replacement is well known in the art of 
bacterial genetics. Alternatively, isolation of such a mutant may be 
achieved by classical chemical mutagenesis, selecting on the basis of 
inability to grow on vanillin. 

20 

It will be appreciated that there are other means for converting vanillin to 
a non- vanillin product and it is preferred if these are absent or reduced in 
the method. 

25 Although Ps. fluorescens biovar. V, strain AN 103 or mutants or variants 
thereof themselves are useful in the method of the invention as whole cells 
or permeabilised or immobilised cells, it is preferred if the enzyme 
activities I, II and III are provided by an intact-cell-free system of Ps, 
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fluorescens biovar, V, stain AN103 or a mutant or variant thereof. 
Suitable systems and extracts may be used by methods well known in the 
art, for example by French pressure cell or sonication followed by 
centrifugation. Alternatively, whole cells may be permeabilised using 
5 methods well known in the art, for example using detergents such as 
dimethyl sulphoxide (DMSO). 

Using such an intact-cell-free system allows the necessary substrates and 
any cofactors to reach readily the relevant enzymes and for the products 
io to be released readily into the reaction medium if this is necessary for 
further reaction; however as discussed below, at least some of the 
enzymes of the invention may be involved in substrate (metabolic) 
channelling. 

15 We have found that none of the enzyme activities I, II and III from Ps. 
fluorescens biovar. V, strain AN 103 is dependent on NAD + whereas 
enzyme activity IV from Ps. fluorescens (vanil!in:NAD + oxido-reductase) 
requires NAD + . 

20 Thus, a preferred way of reducing means for converting vanillin to a 
non-vanillin product in an intact-cell-free system of Ps. fluorescens biovar. 
V, strain AN 103 (or in a cell-permeabilised system of Ps. fluorescens 
biovar. V, strain AN 103) is to omit NAD + from the reaction system. 
Any exogenous NAD* is readily and rapidly depleted by the presence of 

25 fra/is-feruiate in the system. 

For the microorganisms of the present invention which can be used in the 
method of vanillin production, including Ps. fluorescens biovar. V, strain 
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AN 103, at least three main types of bioreactor may be used for the 
biotransformation reactions: the batch tank, the packed bed and the 
continuous- flow stirred tank; their applications and characteristics have 
been reviewed (M.D. Lilly in "Recent Advances in Biotechnology", eds. 
5 F. Vardar-Sukan and S.S. Sukan, Kluwer Academic Publishers, 
Dordrecht, 1992, pp 47-68 and loc. cit.). 

As is described in more detail below, enzyme activities II and III are 
available free from other enzyme activities, for example directly or 
io indirectly from Ps, fluorescens biovar. V, strain AN103 and from other 
organisms or cells which have been genetically modified to express genes 
encoding the said enzyme activities. 

It will be appreciated that other microorganisms will be found which will 
15 be useful in the methods of the invention, for example, by screening. 
Such microorganisms and methods of screening and methods of use form 
part of the invention. The method of screening for other microorganisms 
possessing activities I, II and III is essentially that already described in the 
Materials and Methods section in the Examples for the isolation of AN 
20 103. The important aspect is isolation from an environment rich in trans- 
ferulate or related compounds (eg 4-rra/u-coumarate, mms-caffeate [3,4- 
dihydroxy-/ram-cinnamate} which, as described below, may also be 
substrates for enzyme activity I) and selection for growth on fra^-femlate 
(preferably) as sole carbon source. In practice, preferred sources are those 
25 in which plant-derived materials are being degraded; in addition to soil or 
compost, this would include the outflow or residues from factories or 
other installations processing such materials - eg sugar-beet factories, 
cocoa fermentation heaps etc - and the contents of the gastro-intestinal 
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tract, particularly of ruminants and other herbivores. It is possible that 
anaerobes might be found possessing these activities and due account can 
readily be taken of this in the isolation procedure. A priori, isolation of 
organisms with these activities might also be possible from marine 
5 environments. 



Genera in which further microorganisms useful in the invention will be 
found include Pseudomonas, Arthrobacter, Alcaligenes, Acinetobacter, 
Flavobacterium, Agrobacterium, Rhizobium, Streptomyces, 
10 Saccharomyces, Penicillium and Aspergillus. 



An alternative or additional approach is to use any one of the 
Pseudomonas genes encoding enzyme activities II or III described herein 
or redundant sequences designed from the Pseudomonas enzyme amino- 
15 acid sequences in DNA probes or PCR amplification strategies to find 
related genes in other organisms. As is made more clear below, enzymes 
and nucleotide sequences which are functionally equivalent to those of 
isolated from AN 103 but which differ in sequence form pan of the 
invention. 

20 

Our studies indicate that the enzyme which interconverts frans- ferulate and 
rra/is-feruloyl SCoA in Ps. fluorescens biovar. V, strain AN 103 uses 
Coenzyme A (CoASH), ATP and Mg 2+ or other functionally equivalent 
cofactors. Thus, it is preferred that the method further comprises the step 
25 of (3) providing any one of the cofactors Co ASH, ATP or Mg 2+ or other 
functionally equivalent cofactors. ATP is adenosine triphosphate. It is 
well known that other functionally equivalent cofactors can substitute in 
some cases for CoASH, ATP or Mg 2+ . For example Mn 2+ may be used 
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in place of Mg 2+ and derivatives or analogues of ATP, preferably with a 
hydroly sable 7-phosphate, may be used in place of ATP. 

We have also determined that, at least when the enzyme activity I is 
provided by the Pseudomonas AN 103 enzyme which interconverts 
rm/15-ferulate and rra/w-feruloyl SCoA and which enzyme uses ATP and 
Coenzyme ASH, it is convenient to include a system wherein either one, 
or both, of the cofactors Coenzyme ASH and ATP is recycled. The 
following ATP generation and Co ASH recycling systems are preferred. 

ATP generation: - 

(i) trans-Ferulate + CoASH + ATP + H 2 0 Vanillin + Acetyl 
SCoA + AMP + PPi (overall reaction catalysed by Ps. fluorescens 
biovar. V, strain AN 103 extract) 

(ii) AMP + ATP 5=k 2 ADP (adenylate kinase) 

(iii) Acetyl - P + ADP Acetate 4- ATP (acetate kinase) 

(iv) Sum; trans-Ferulate 4- CoASH + 2 Acetyl ~ P + H 2 0 -* 
Vanillin + Acetyl SCoA 4- 2 Acetate 4- PPi 

CoASH recycling is achievable using commercially-available citrate 
synthase (EC 4.1.3,7) and citrate lyase (EC 4.1.3.6), viz:- 

(v) Acetyl SCoA 4- Oxaloacetate 4- H 2 0 & Citrate 4- CoASH (citrate 
25 synthase) 

(vi) Citrate *=* Oxaloacetate 4- Acetate (citrate lyase) 

Overall sum, (iv)-(vi): 



5 



10 



15 



20 
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(vii) rra/w-Ferulate + 2 Acetyl ~ P + 2 H 2 0 -* Vanillin + 3 Acetate 
+ PPi 



The acetyl — P used in the overall process, (vii), would not itself be 
5 generated by enzymic means; however, none of its atoms would appear 
in the vanillin product. 

Acetyl phosphate is commercially available or can be synthesised using the 
method described by Stadtman (1957) Meth. EnzymoL 3, 228-231. 

10 

The reagents are commercially available from, for example Sigma 
Chemical Co, Fancy Road, Poole, Dorset, UK. Citrate lyase is typically 
from Enterobacter aerogenes\ citrate synthase is typically from chicken 
heart, pigeon breast muscle or porcine heart. 

15 

Thus, it is preferred if coenzyme ASH is recycled using the enzymes 
citrate synthase and citrate lyase; and it is preferred if the ATP is 
generated using the enzymes adenylate kinase (EC 2.7.4.3) and acetate 
kinase (EC 2.7.2.1). 

20 

The co-factor recycling system is particularly preferred when using an 
intact-cell-free system. 

Trans- ferulic acid or a salt thereof is readily available from plant material. 
25 Suitably, trans-femWc acid or a salt thereof is released from the plant 
material by the action of ferulic acid esterase. Thus, in a particularly 
preferred embodiment of the invention the /ra/w-ferulic acid or salt thereof 
is provided by the action of ferulic acid esterase on plant material. 
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Trans-ferulic acid and fnms-4-coumaric acid can together represent up to 
1.5% by weight of the cell walls of temperate grasses (R.D. Hartley and 
E.C. Jones, Phytochemistry 16, 1531-1534 (1977)). 7><my-ferulic acid is 
reported to comprise 0.5% (w/w) of wheat bran (M. C. Raiet, J.-F. 

5 Thibault and G. Delia Valle, 7. Cereal Sci. 11, 249-259 (1990)), 3. 1 % of 
maize bran (L. Saulnier, C. Marot, E. Chanliaud and J.-F. Thibault, 
Carbohydr. Polym. 26, 279-287 (1995)) and 0.8% of sugar beet pulp (V. 
Micard, G.M.G.C, Renard and J.-F. Thibault, Lebensm.-Wiss. u-TechnoL 
27, 59-66 (1994)). These materials are amongst the preferred sources of 

10 rra/is-ferulic acid. Since rra/z,s-ferulic acid is present esterified with cell- 
wall polysaccharides, hydrolysis is essential. Alkaline or acid hydrolysis 
is possible, but enzymic hydrolysis is preferred. Typically, the initial step 
is the partial enzymic hydrolysis of carbohydrates (arabinans, xylans, 
rhamnogalacturanans) to which mz/25-ferulate is linked, followed by the 

15 release of frans-ferulate from the oligosaccharide fragments by trans- 
ferulic acid esterase activity. In practice, both steps may occur 
simultaneously in the reaction mixture. Descriptions of representative 
laboratory-scale processes are available in the literature (for example see 
L.P. Christov and B.A. Prior, Enzyme Microb. Technol. 15, 460-475 

20 (1993)); C.B. Faulds and G. Williamson, AppL Microbiol. BiotechnoL 43, 
1082-1087 (1995); C.B. Faulds, P.A. Kroon, L. Saulnier, J.-F. Thibault 
and G. Williamson, Carbohydrate Polymers 27, 187-190 (1995)). 
Phenolic acid- releasing enzymes have been reported from a number of 
microorganisms, including Streptomyces olivochromogenes (C.B. Faulds 

25 and G. Williamson, 7. Gen, Microbiol. 137, 2337-2345 (1991)), 
Penicillium pinophilum (A. Castanares, S.I. McCrae and T.M. Wood, 
Enzyme Microb. TechnoL 14, 875-884(1992)), Neocallimastix spp . (W.S. 
Borneman, R.D. Hartley, W,H. Morrison, D.E. Akin and L.G. 



BNSDOC1D: <WO 9735999A2J_> 



WO 97/35999 PCT/GB97/00809 

15 

Ljungdahl , AppL Microbiol. Biotechnol. 33 , 345-35 1 ( 1 990)) , 
Schizophyllum commune (R.C. MacKenzie and D. Bilous, AppL Envir. 
Microbiol. 54, 1170-1173 (1988)) and Aspergillus spp. (M. Tenkanen, J. 
Schuseil, J. Puis and K. Poutanen, J. Biotechnol. 18, 69-84 (1991); C.B. 

5 Faulds and G. Williamson, Microbiology 140, 779-787 (1994)). A trans- 
ferulic acid esterase (XYLD) has been characterised from Pseudomonas 
fluorescens subsp. cellulosa, together with an arabinofuranosidase (XYLC) 
and an endoxylanase (XYLB; L.M.A. Ferreira, T.M. Wood, G. 
Williamson, C.B. Faulds, G.P. Hazlewood and H.J. Gilbert, Biochem. J. 

10 294, 349-355 (1993)). The genes for all three enzymes have been isolated 
(G.P. Hazlewood and H.J. Gilbert, in "Xylans and Xylanases", eds. J. 
Visser, G. Beldman, M.A. Kusters-van Someren and A.G.J. Voragen, 
Elsevier, Amsterdam, pp 259-273 (1992)). All of these references are 
incorporated herein by reference. 

15 

Thus, advantageously the trans- ferulic acid or a salt thereof may be 
provided by the action of trans-ferulic acid esterase on said ester. More 
particularly, it is advantageous to introduce a gene encoding said esterase 
into a host cell or organism which is being used in the methods of the 
20 invention. Thus, it is convenient to introduce a trans- ferulic acid esterase 
gene, such as the aforementioned XYLD gene, into a plant which is being 
used in the methods of the invention. 

Although, as described above, the method may be performed using 
25 enzyme activities I, II and III which are provided by Ps. fluorescens 
biovar. V, strain AN 103 or mutants or variants thereof themselves, or 
intact-cell-free extracts thereof, it is preferred if at least one of the enzyme 
activities II and III is provided by a substantially purified enzyme. 
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Substantially purified enzymes with enzyme activities II and III are 
described below. 

In a particularly preferred embodiment of the invention the method of the 
5 first aspect of the invention further comprises providing a compound, in 
addition to frans-ferulic acid or a salt thereof, which may be converted by 
any one of enzyme activities I, II or III into a desirable product. Suitably 
said compound is converted by any one or more of said enzyme activities 
into a product which is found in, and preferably contributes to the taste or 
10 aroma of, vanilla as extracted from Vanilla pod. 

Vanilla as extracted from Vanilla pod contains vanillin as a major 
component but also smaller quantities of desirable components such as p- 
hydroxybenzoic acid, p-hydroxybenzaldehyde and vanillic acid. Typically 
15 these components, and vanillin, are present as glucosides in green vanilla 
pods as well as in the free form. However, upon hydrolysis or 
fermentation of the green pods or of hydrolysis of the fermented pods, 
most of the components are present in the free form. 

20 Thus, it is particularly preferred if said compound is any one of trans-A- 
coumaric acid or a salt thereof, frans-4-coumaroyl SCoA, rra/75-caffeic 
acid or a salt thereof, frans-caffeoyl SCoA, or 3,4-methylenedioxy-/ra;?s- 
cinnamic acid or a salt thereof. By the action of one or more of enzyme 
activities I, II or III rrans-4-coumaric acid or a salt thereof and trans-A- 

25 coumaroyl SCoA are converted to p- hydro xybenzaldehyde; //vz/w-caffeic 
acid or a salt thereof and /ra/is-caffeoyl SCoA are converted to 3,4- 
dihydroxybenzaldehyde; and 3,4-methylenedioxy-rrarzs-cinnaniic acid or 
a salt thereof is converted to heliotropin. 
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It is preferred if the compound is franj-4-coumaric acid or a salt thereof 
or mmy-4-coumaroyl SCoA and that the desirable product is 4- 
hydroxybenzaldehyde which is a significant component of natural Vanilla 
extract. 

5 

The enzyme activities I, II and III from Ps. fluorescens biovar V, strain 
AN 103 are able to use mmy-caffeate and rran5-4-coumarate, (and, as 
appropriate, the products of their reaction with enzyme activity I) with 
reasonable efficiency whereas cinnamate and 3,4-methylenedioxy-/rans- 
10 cinnamate, although may be used as substrates, are poor substrates of the 
AN 103 enzymes. 

Thus, the method of the first aspect of the invention is suited to make 
vanilla flavourings and aromas which more closely resemble the vanilla 
15 from Vanilla pod. 

The method of the first aspect of the invention may, in certain 
circumstances, also be performed using the host cells and genetically 
modified cells and organisms as described below in more detail. 

20 

A second aspect of the invention provides a method of producing vanillic 
acid, or a salt thereof, comprising the steps of 

(1) providing /rans-ferulic acid or a salt thereof; 
25 (2) providing rra/75-ferulate: Co ASH ligase activity, /ra/w-feruloyl SCoA 
hydratase activity, and 4-hydroxy-3-methoxyphenyl-jS-hydroxy- 
propionyl SCoA (HMPHP SCoA) cleavage activity; and 
(3) providing an activity that interconverts vanillin and vanillic acid. 
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For convenience, the activity that interconverts vanillin and vanillic acid 
is called enzyme activity IV. Conveniently the activity is provided by 
vanillin: NAD + oxidoreductase (vanillin dehydrogenase). Suitably, this 
activity is provided by Ps. fluorescens biovar. V, strain AN 103. Methods 

5 of converting vanillin to vanillic acid or a salt thereof are also known in 
the art, for example Perestelo et al (1989) App. Environ. Microbiol. 55, 
1660-1662 describes the production of vanillic acid from vanillin by 
resting cells of Serratia marcescens and Pomelto & Crawford (1983) App. 
Environ. Microbiol. 45, 1582-1585 describe whole-cell bioconversion of 

io vanillin to vanillic acid by Streptomyces viridosporus . 

The method of producing vanillic acid provided by the invention therefore 
includes the steps of exposing //ww-ferulic acid or a salt thereof to enzyme 
activity I and forming a product, exposing the said product of enzyme 
15 activity I to enzyme activity II to form a product, exposing the said 
product of enzyme activity II to enzyme activity III to form a product, and 
exposing the said product of enzyme activity III to enzyme activity IV to 
form a product. 

20 It will be appreciated that vanillic acid can be made by the same means as 
vanillin is made in the method of the first aspect of the invention 
provided, of course, that enzyme activity IV is supplied. 

A further preferred embodiment of the first aspect of the invention 
25 comprises the further step of separating vanillin from the other reaction 
components. 
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Vanillin, and other aromatic aldehydes, are, for example, recoverable by 
extraction with solvent, including supercritical carbon dioxide, and by 
organophilic pervaporation, using membranes constructed of hydrophobic 
polymers (G. Bengston and K.W. Bodekker, in "Bioflavour 95", eds. P. 

5 Etievant and P. Schreier, INRA, Paris, pp 393-403 (1995); S.M. Zhang 
and E. Drioli, Separ. Sci. TechnoL 8, 1-31 (1994)); pervaporation 
technology has been applied, for example, to the recovery of flavour 
compounds of wine (N. Rajagopalan and M. Cheryan, J. Membrane Sci. 
104, 243-250 (1995)). Solid-phase extraction, followed by desorption 

10 with solvent, is also possible, though less preferred. 

However, in some circumstances, particularly where minor reaction 
products are present which are similar to compounds present in the vanilla 
isolated from Vanilla pod, vanillin is not isolated. 

15 

A further preferred embodiment of the second aspect of the invention 
comprises the further step of separating vanillic acid or a salt thereof from 
the other reaction components. 

20 Vanillic acid and other carboxylic acids may, for example, be recovered 
by solid-phase extraction, by solvent extraction under acidic conditions, 
or by pertraction; for example, L. Boyadzhiev and I. Atanassova (Process 
Biochemistry 29, 237-243 (1994)) describe the recovery of the aromatic 
amino acid, phenylalanine, by pertraction. 

25 

A third aspect of the invention provides Pseudomonas fluorescens biovar. 
V, strain AN 103 as deposited under the Budapest Treaty at the National 
Collections of Industrial and Marine Bacteria Limited, AURIS Business 
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Centre, 23 St Machar Drive, Aberdeen AB2 1RY, Scotland under 
Accession No NCIMB 40783, or a mutant or variant thereof. Preferred 
mutants and variants are the same as those preferred in the first aspect of 
the invention. A particularly preferred mutant of Ps. fluorescens biovar. 

5 V, strain AN 103 is one which accumulates vanillin when provided with 
rrans-ferulic acid or a salt thereof. Conveniently, this is a mutant of Ps. 
fluorescens biovar. V, strain AN 103 wherein vanillin: NAD* 
oxidoreductase activity is absent or reduced. Suitably, there is a mutation 
in the gene encoding vanillin: NAD + oxidoreductase such that the enzyme 

10 activity is absent or substantially reduced. Such a mutant can be made 
as described above. 

A fourth aspect of the invention provides a polypeptide which, in the 
presence of appropriate cofactors if any, is capable of catalysing the 

15 interconversion of /r<2/2s-feruloyl SCoA and 4-hydroxy-3-methoxy- 
phenyl-jS-hydroxypropionyl SCoA (HMPHP SCoA). Such a polypeptide 
has enzyme activity II. Conveniently, the polypeptide comprises 
/ra/ts-feruloyl SCoA hydratase; more conveniently the polypeptide 
comprises /ra/is-feruloyl SCoA hydratase from Ps. fluorescens biovar. V, 

20 strain AN 103 or fragments or variants thereof which have at least 1 % of 
the specific activity of the native enzyme (in relation to fra/15-feruloyl 
SCoA hydratase activity), preferably at least 10%, more preferably at least 
100%. 

25 The enzyme activity is readily purified as described in the Examples. 
Modifications to this procedure may be readily made by the person skilled 
in the art so that a polypeptide with enzyme activity II can be obtained 
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from any suitable source making use of the enzyme activity II assay 
procedure described in the Examples. 

It is preferred if the polypeptide of the fourth aspect of the invention 
5 comprises the amino acid sequence. 

MetSerThrTyrGluGlyArgTrpLysThrValLysValGluIleGluAspGlylleAla 
PheVallleLeuAsnArgProGluLysArgAsnAlaMetSerProThrLeuAsnArgGlu 
MetlleAspValLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValLeuThr 

10 GlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheArgGluValAsp 
AlaGlyProGluIleLeuGlnGluLysIleArgArgGluAlaSerGlnTrpGlnTrpIiys 
LeuLeuArgMetTyrAlaLysProThrlleAlaMetValAsnGlyTrpCysPheGlyGly 
GlyPheSerProLeuValAlaCysAspLeuAlalleCysAlaAspGluAlaThrPheGly 
LeuSerGluIleAsnTrpGlylleProProGlyAsnLeuValSerLysAlaMetAlaAsp 

15 ThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrPheGlyGlyGln 
LysAlaAlaGluMetGlyLeuValAsnGluSerVaLProLeuAlaGlnLeuArgGluVal 
ThrlleGluLeuAlaArgAsnLeuLeuGluLysAsnProValValLeuArgAlaAlaLys 
Hi sGlyPheLysArgCysArgGluLeuThrTrpGluGlnAsnGluAspTyrLeuTyrAla 
LysLeuAspGlnSerArgLeuLeuAspThrGluGlyGlyArgGluGlnGlyMetLysGln 

20 PheLeuAspAspLysSerlleLysProGlyLeuGlnAlaTyrLysArg { SEQ ID No 2) , 

or a fragment or variant thereof. 

The amino acid sequence is that given in Figure 12 as that encoded by 
25 nucleotides 2872 to 3699. 

A fifth aspect of the invention provides a polypeptide which, in the 
presence of appropriate cofactors if any, is capable of catalysing the 
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interconversion of 4-hydroxy-3-methoxyphenyl-/3-hydroxy-propionyl 
SCoA (HMPHP SCoA) and vanillin. Such a polypeptide has enzyme 
activity III. Conveniently, the polypeptide comprises HMPHP SCoA 
cleavage enzyme; more conveniently the polypeptide comprises HMPHP 
5 SCoA cleavage enzyme from Ps. fluorescens biovar. V, strain AN 103 or 
fragments or variants thereof which have at least 1% of the specific 
activity of the native enzyme (in relation to HMPHP SCoA cleavage 
activity), preferably at least 10%, more preferably at least 100%. 



io The enzyme activity is readily purified as described in the Examples. 
Modifications to this procedure may be readily made by the person skilled 
in the art so that a polypeptide with enzyme activity III can be obtained 
from any suitable source making use of the enzyme activity III assay 
procedure described in the Examples. 

15 

It is preferred if the polypeptide of the fifth aspect of the invention 
comprises the amino acid sequence 

MetSerThrTyrGluGlyArgTrpLysThrValLysValGluIleGluAspGlylleAla 
20 PheVallleLeuAsnArgProGluLysArgAsnAlaMetSerProThrLeuAsnArgGlu 
MetlleAspVaLLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValLeuThr 
GlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheArgGluValAsp 
AlaGlyProGluIleLeuGlnGluLysIleArgArgGluAlaSerGlnTrpGlnTrpLys 
LeuLeuArgMetTyrAlaLysProThrlleAlaMetValAsnGlyTrpCysPheGlyGly 
25 GlyPheSerProLeuValAlaCysAspLeuAlalleCysAlaAspGluAlaThrPheGly 
LeuSerGluIleAsnTrpGly IleProProGlyAsnLeuValSerLysAlaMetAlaAsp 
ThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrPheGlyGlyGln 
LysAlaAlaGluMetGlyLeuValAsnGluSerValProLeuAlaGlnLeuArgGluVal 
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ThrlleGluLeuAlaArgAsnLeuLeuGluLysAsnProVal ValLeuArgAlaAlaLys 
Hi sGlyPheLysArgCysArgGluLeuThrTrpGluGlnAsnGluAspTyrLeuTyrAla 
LysLeuAspGlnSerArgLeuLeuAspThrGluGlyGlyArgGluGlnGlyMet LysGln 
PheLeuAspAspLysSerlleLysProGlyLeuGlnAlaTyrLysArg (SEQ ID No 2) , 



or a fragment or variant thereof. 

The amino acid sequence is that given in Figure 12 as that encoded by 
nucleotides 2872 to 3699. 

A sixth aspect of the invention provides a polypeptide comprising the 
amino acid sequence 



10 



MetLetoAspValProLeuLeuIleGlyGlyGlnSerCysProAlaArgAspGlyArgThr 
15 PheGluArgArgAsnProValThrGlyGluLeuValSerArgValAlaAlaAlaThrLeu 
GluAspAlaAspAlaAlaValAlaAlaAlaGlnGlnAlaPheProAlaTrpAlaAlaLeu 
AlaProAsnGluArgArgSerArgLeuLeuLysAlaAlaGluGlnLeuGlnAlaArgSer 
GlyGluPhelleGluAlaAlaGlyGluThrGlyAlaMetAlaAsnTrpTyrGlyPheAsn 
ValArgLeuAlaAlaAsnMetLeuArgGluAlaAlaSerMetThrThrGlnValAsnGly 
20 GluVallleProSerAspValProGlySerPheAlaMetAlaLeuArgGlnProCysGly 
ValValLeuGlylleAlaProTrpAsnAlaProVallleLeuAlaThrArgAlalleAla 
MetProLeuAlaCysGlyAsnThrValValLeuLysAlaSerGluLeuSerProAlaVal 
HisArgLeuIleGlyGlnValLeuGlnAspAlaGlyLeuGlyAspGlyValValAsnVal 
rieSerAsnAlaProAlaAspAlaAlaGlnlleValGluArgLeuIleAlaAsnProAla 
25 ValArgArgValAsnPheThrGlySerThrHisValGlyArglleValGlyGluLeuSer 
AlaArgHisLeuLysProAlaLeuLeuGluLeuGlyGlyLysAlaProLeuLeuValLeu 
AspAspAlaAspLeuGluAlaAlaValGlnAlaAlaAlaPheGlyAlaTyrPheAsnGln 
GlyGlnlleCysMetSerThrGluArgLeuIleValAspAlaLysValAlaAspAlaPhe 
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ValAlaGlnLeuAlaAlaLysValGluThrLeuArgAlaGlyAspProAlaAspProGlu 
SerValLeuGlySerLeuValAspAlaSerAlaGlyThrArglleLysAlaLeuIleAsp 
AspAlaValAlaLysGlyAlaArgLeuVallleGlyGlyGlnLeuGluGlySerlleLeu 
GlnProThrLeuLeuAspGlyValAspAlaSerMetArgLeuTyrArgGluGluSerPhe 
5 GlyProValAlaValValLeuArgGlyGluGlyGluGluAlaLeuL.euGlnLeuAlaAsn 
AspSerGluPheGlyLeuSerAlaAlallePheSerArgAspThrGlyArgAlaLeuAla 
LeviAlaGlnArgValGluSerGlylleCysHisIleAsnGlyProThrValHisAspGlu 
AlaGlnMetProPheGlyGlyValLysSerSerGlyTyrGlySerPheGlyGlyLysAla 
SerlleGluHisPheThrGlnLeuArgTrpValThrLeuGlnAsnGlyProArgHisTyr 
10 Prolle 

(SEQ ID No 4) or a fragment or variant thereof. 

The amino acid sequence is that given in Figure 12 as that encoded by 
15 nucleotides 3804 to 5249. 

As described in detail in the examples, this polypeptide sequence encodes 
an enzyme with vanillin: NAD * oxidoreductase activity from Ps. 
fluorescens biovar V., strain AN103. 

20 

By tk variants" we include deletions, insertions and substitutions either 
conservative or non-conservative, where such changes may reduce or 
enhance the activity, or may not substantially alter the activity. In 
particular, the seventh aspect of the invention includes the complete 
25 polypeptide sequence of Ps. fluorescens biovar V, strain AN 103 
vanillin: NAD + oxidoreductase and this polypeptide itself. 
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By "conservative substitutions" is intended combinations such as Gly, Ala; 
Val, He, Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 
Such variants may be made using the methods of protein engineering and 
site-directed mutagenesis as described below and as is well known in the 
5 art. 

A preferred embodiment of the invention is a polypeptide as defined in the 
fourth, fifth or sixth of the invention which is substantially pure. 

10 By "substantially pure" we mean that the polypeptide is substantially free 
of other polypeptides, or other macromolecules, with which it is usually 
found in nature. Suitably, the polypeptide is substantially free of any 
other polypeptides or macromolecules. It is preferred if the polypeptide 
has less than 50% by weight of any other polypeptide, preferably less than 

15 10%, more preferably less than 1 %, still more preferably less than 0.1 % 
and most preferably less than 0.01%. 

Polypeptides can be purified using methods known in the art. It is 
preferred if the polypeptide is the product of a recombinant DNA. 

20 

A single polypeptide chain may comprise more than one of the enzyme 
activities (II) trans- feruloyl SCoA hydratase activity or (III) HMPHP 
SCoA cleavage activity. 

25 Our data in the Examples shows that, in the case of Ps. fluorescens 
biovar. V, strain AN 103, enzyme activities II and III are found in the 
same polypeptide chain, the sequence of which is given as the preferred 
polypeptides of the fifth and sixth aspects of the invention. Thus, when 
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enzyme activities II and III are provided in any aspect of the invention it 
is most convenient if they are provided in the same polypeptide chain. 

It will be appreciated that, using protein engineering methods or chemical 
5 cross-linking it may be possible to produce a single molecule which has 
enzyme activities II and III. Such a molecule, therefore, forms a further 
aspect of the invention. 

An seventh aspect of the invention provides a polynucleotide encoding a 
10 polypeptide as defined in any one of the fourth, fifth or sixth aspects of 
the invention. 

By "polynucleotide" we include RNA and DNA. DNA is preferred. 

15 Thus, this aspect of the invention provides a polynucleotide which encodes 
any one of a polypeptide which has (II) fra/is-feruloyl SCoA hydratase 
activity; (III) HMPHP SCoA cleavage activity; (IV) vanillin: NAD + 
oxidoreductase activity ; or a polypeptide which has more than one of these 
activities. Preferably the polynucleotide is derived from Ps. fluorescens 

20 biovar. V, strain AN 103. A preferred polynucleotide comprises all or ai 
least a part of the Ps. fluorescens DNA contained within the cosmid clone 
pFI793 as deposited under the Budapest Treaty at the National Collections 
of Industrial and Marine Bacteria Limited, AURIS Business Centre, 23 St 
Machar Drive, Aberdeen AB2 1RY, Scotland under Accession No 

25 NCIMB 40777, or a fragment or variant thereof. 

The isolation of the cosmid clone pFI 793 is described in Example 5; pFI 
793 includes DNA which encodes polypeptides which have enzyme 
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activities II, III and IV. The cosmic! clone pFI 793 itself, the genes 
contained in the Ps. fluorescens DNA thereof, and variants thereof form 
separate aspects of the invention. 

5 A variant of a polynucleotide includes any insertion, deletion or 
substitution of the sequence which encodes a fragment or variant of a 
polypeptide as defined above. 

For example, site-directed mutagenesis or other techniques can be 
10 employed to create single or multiple mutations, such as replacements, 
insertions, deletions, and transpositions, as described in Botstein and 
Shortle, Strategies and Applications of In Vitro Mutagenesis, Science, 229: 
193-210 (1985), which is incorporated herein by reference. Since such 
modified polynucleotides can be obtained by the application of known 
15 techniques to the teachings contained herein, such modified 
polynucleotides are within the scope of the claimed invention. 

Moreover, it will be recognised by those skilled in the art that the 
polynucleotide sequence (or fragments thereof) of the invention can be 
20 used to obtain other DNA sequences that hybridise with it under 
conditions of high stringency. Such DNA includes any genomic DNA. 

Accordingly, the polynucleotide of the invention includes DNA that shows 
at least 55 per cent, preferably 60 per cent, and most preferably 70 per 
25 cent homology with the polynucleotide sequences identified in the 
invention, provided that such homologous DNA encodes a protein which 
is usable in the methods described herein. 
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DNA-DNA, DNA-RNA and RNA-RNA hybridisation may be performed 
in aqueous solution containing between 0.1XSSC and 6XSSC and at 
temperatures of between 55 °C and 70°C, It is well known in the art that 
the higher the temperature or the lower the SSC concentration the more 
5 stringent the hybridisation conditions. By high stringency we mean 
2XSSC and 65°C. 1XSSC is 0.15M NaCl/0.015M sodium cirrate. 



w Variants" of the polynucleotide include polynucleotides in which 
relatively short stretches (for example 20 to 50 nucleotides) have a high 
10 degree of homology (at least 50% and preferably at least 90 or 95%) with 
equivalent stretches of the polynucleotide of the invention even though the 
overall homology between the two polynucleotides may be much less. 
This is because important active or binding sites may be shared even when 
the general architecture of the protein is different. 

15 

A particularly preferred polynucleotide comprises the nucleotide sequence 

ATGAGCACATACGAAGGTCGCTGGAAAACGGTCAAGGTCGAAATCGAAGACGGCATCGCG 
TTTGTCATCCTCAATCGCCCGGAAAAACGCAACGCGATGAGCCCGACCCTGAACCGCGAG 

20 ATGATCGATGTTCTGGAAACCCTCGAGCAGGACCCTGCCGCCGGTGTGCTGGTGCTGACC 
GGTGCGGGCGAAGCCTGGACCGCAGGCATGGACCTCAAGGAATACTTCCGCGAAGTGGAC 
GCCGGCCCGGAAATCCTCCAGGAAAAAATCCGCCGCGAAGCCTCGCAATGGCAATGGAAA 
CTGCTGCGCATGTACGCCAAGCCGACCATCGCCATGGTCAATGGCTGGTGCTTCGGCGGC 
GGTTTCAGCCCGCTGGTGGCCTGCGACCTGGCGATCTGCGCCGACGAAGCAACCTTCGGT 

25 CTCTCGGAAATCAACTGGGGTATCCCGCCGGGCAACCTGGTGAGCAAGGCCATGGCCGAC 
ACCGTGGGCCACCGCCAGTCGCTCTACTACATCATGACCGGCAAGACCTTCGGTGGGCAG 
AAAGCCGCCGAGATGGGCCTGGTCAACGAAAGCGTGCCCCTGGCGCAACTGCGCGAAGTC 
ACCATCGAGCTGGCGCGTAACCTGCTCGAAAAAAACCCGGTGGTGCTGCGTGCCGCCAAA 
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CACGGTTTCAAACGCTGCCGCGAACTGACCTGGGAGCAGAACGAGGATTACCTGTACGCC 
AAGCTCGATCAGTCGCGTTTGCTGGACACCGAAGGCGGTCGCGAGCAGGGCATGAAGCAATTCC 
TCGACGACAAGAGCATCAAGCCTGGCCTGCAAGCGTATAAACGC (SEQ ID No 1) 

5 (as given in Figure 12, nucleotides 2872 to 5249) or a fragment or variant 
thereof. 



This polynucleotide encodes HMPHP SCoA cleavage enzyme activity and 
a rra/is- feruloyl SCoA hydratase activity from Ps. fluorescens biovar. V, 
10 strain AN103 and encodes the preferred polypeptide of the fifth and sixth 
aspects of the invention. 

A further particularly preferred polynucleotide comprises the nucleotide 
sequence 

15 

ATGCTGGACGTGCCCCTGCTGATTGGCGGCCAGTCGTGCCCCGCGCGCGACGGTCGAACC 
TTCGAGCGCCGCAACCCGGTGACTGGCGAGTTGGTGTCGCGGGTTGCCGCCGCCACCCTG 
GAAGATGCCGACGCCGCCGTGGCCGCTGCCCAGCAAGCGTTTCCCGCGTGGGCCGCGCTG 
GCGCCCAATGAACGGCGCAGCCGTTTGCTCAAGGCCGCCGAACAATTGCAGGCGCGCAGC 

20 GGCGAGTTCATCGAGGCGGCGGGCGAGACCGGCGCCATGGCCAACTGGTACGGGTTCAAC 
GTACGGCTGGCGGCCAACATGCTGCGTGAAGCGGCATCGATGACCACCCAGGTCAATGGT 
GAAGTGATTCCCTCGGACGTTCCCGGCAGTTTCGCCATGGCCCTGCGCCAGCCCTGTGGC 
GTGGTGCTGGGCATCGCCCCCTGGAACGCCCCGGTGATTCTCGCCACCCGGGCGATTGCC 
ATGCCGCTGGCCTGTGGCAACACCGTGGTGCTGAAGGCTTCCGAGCTGAGTCCGGCGGTG 

25 CATCGCTTGATCGGCCAGGTGCTGCAGGACGCCGGCCTGGGCGATGGCGTGGTCAACGTC 
ATCAGTAATGCGCCGGCGGATGCGGCACAGATTGTCGAGCGCCTGATTGCCAACCCGGCC 
GTACGCCGGGTCAATTTCACCGGTTCGACCCACGTCGGGCGCATTGTCGGCGAGCTCTCG 
GCGCGCCACCTCAAACCGGCGTTGCTCGAGCTGGGCGGCAAGGCACCGTTGCTGGTGCTC 
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GACGATGCCGACCTGGAGGCTGCCGTGCAGGCGGCGGCGTTTGGCGCCTACTTCAACCAG 
GGACAGATCTGTATGTCCACCGAGCGCCTGATTGTCGATGCCAAGGTGGCCGACGCCTTT 
GTCGCCCAGTTGGCGGCCAAGGTCGAGACCCTGCGCGCCGGTGATCCTGCCGACCCGGAG 
TCGGTGCTCGGTTCGCTGGTGGACGCCAGCGCTGGCACGCGGATCAAAGCGTTGATCGAT 
5 GATGCCGTGGCCAAGGGCGCGCGCCTGGTAATCGGCGGGCAACTGGAGGGCAGCATCTTG 
CAGCCGACCCTGCTCGACGGTGTCGACGCGAGCATGCGTTTGTACCGCGAAGAGTCCTTC 
GGCCCGGTGGCGGTGGTGCTGCGCGGCGAGGGCGAAGAAGCGCTGTTGCAACTGGCCAAC 
GACTCCGAGTTCGGTTTGTCGGCGGCGATTTTCAGTCGTGACACCGGCCGTGCCCTGGCC 
CTGGCCCAGCGGGTCGAATCGGGCATCTGCCACATCAACGGCCCGACCGTGCACGACGAA 
10 GCGCAAATGCCTTTTGGCGGGGTCAAGTCCAGCGGCTACGGCAGTTTTGGCGGCAAGGCA 
TCGATTGAGCATTTCACTCAGTTGCGCTGGGTCACCCTCCAGAATGGTCCACGGCACTAT 
CCGATC 

(SEQ ID No 3) (as given in Figure 12, nucleotides 3804 to 5249) 

15 

or a fragment or variant thereof. This polynucleotide encodes the 
polypeptide sequence of the sixth aspect of the invention. It is particularly 
convenient to isolate the whole gene from cosmid clone pFI 793 as 
deposited under the Budapest Treaty at NCIMB under Accession No 
20 NCIMB 40777. 

The polynucleotides of the invention are all readily isolated from pFI 793 
by probing with the given sequences or parts thereof, or by other methods 
known in the art, and the nucleotide sequences can be confirmed by 
25 reference to the deposited cosmid (pFI 793). 

It will be appreciated that fragments and variants of the polynucleotides 
of the invention can readily be made by the person skilled in the art using 
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standard molecular biological methods such as those described in 
Sambrook et al "Molecular Cloning, a laboratory manual", (1989), (2nd 
Edition), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York. The whole gene and variants and fragments thereof are specifically 
5 included in this aspect of the invention. 

It is preferred if the polynucleotide, conveniently DNA, is joined to a 
nucleic acid vector. 

io DNA constructs of the invention may be purified from the host cell using 
well known methods. 

For example, plasmid vector DNA can be prepared on a large scale from 
cleaved ly sates by banding in a CsCl gradient according to the methods of 
15 Clewell & Helinski (1970) Biochemistry 9, 4428-4440 and Clewell (1972) 
J. Bacteriol, 110, 667-676. Plasmid DNA extracted in this way can be 
freed from CsCl by dialysis against sterile, pyrogen-free buffer through 
Visking tubing or by size-exclusion chromatography. 

20 Alternatively, plasmid DNA may be purified from cleared lysates using 
ion-exchange chromatography, for example those supplied by Qiagen 
(Chatsworth, CA, USA). Hydroxyapatite column chromatography may 
also be used. 

25 The DNA is then expressed in a suitable host to produce a polypeptide of 
the invention. Thus, a DNA encoding a polypeptide of the invention may 
be used in accordance with known techniques, appropriately modified in 
view of the teachings contained herein, to construct an expression vector, 
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which is then used to transform an appropriate host cell for the expression 
and production of the polypeptide of the invention. Such techniques 
include those disclosed in US Patent Nos. 4,440,859 issued 3 April 1984 
to Rutter et al, 4, 530,901 issued 23 July 1985 to Weissman, 4,582,800 

5 issued 15 April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark 
et al y 4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 
November 1987 to Itakura et al, 4,710,463 issued 1 December 1987 to 
Murray, 4,757,006 issued 12 July 1988 to Toole, Jr. et al y 4,766,075 
issued 23 August 1988 to Goeddel et aU and 4,810,648 issued 7 March 

10 1989 to Stalker, all of which are incorporated herein by reference. 

The DNA encoding the polypeptide constituting the compound of the 
invention may be joined to a wide variety of other DNA sequences for 
introduction into an appropriate host. The companion DNA will depend 
15 upon the nature of the host, the manner of the introduction of the DNA 
into the host, and whether episomal maintenance or integration is desired. 

Generally, the DNA is inserted into an expression vector, such as a 
plasmid, in proper orientation and correct reading frame for expression. 

20 If necessary, the DNA may be linked to the appropriate transcriptional and 
translational regulatory control nucleotide sequences recognised by the 
desired host, although such controls are generally available in the 
expression vector. The vector is then introduced to the host through 
standard techniques. Generally, not all of the hos^ will be transformed 

25 by the vector. Therefore, it will be necessary to select for transformed 
host cells. One selection technique involves incorporating into the 
expression vector a DNA sequence, with any necessary control elements, 
that codes for a selectable trait in the transformed cell, such as antibiotic 
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resistance. Alternatively, the gene for such selectable trait can be on 
another vector, which is used to co-transform the desired host cell. 

Host cells that have been transformed by the recombinant DNA of the 
5 invention may then be cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art in view of the teachings 
disclosed herein to permit the expression of the polypeptide, which, if 
desirable, can then be recovered. 

10 Many expression systems are known, including bacteria (for example 
Escherichia coli and Bacillus subtilis), yeasts (for example Saccharomyces 
cerevisiae), filamentous fungi (for example Aspergillus), plant cells and 
whole plants, animal cells and insect cells. 

15 The vectors include a prokaryotic replicon, such as the CoIEl or/, for 
propagation in a prokaryote, even if the vector is to be used for expression 
in other, non-prokaryotic, cell types. The vectors can also include an 
appropriate promoter such as a prokaryotic promoter capable of directing 
the expression (transcription and translation) of the genes in a bacterial 

20 host cell, such as E. coli, transformed therewith. 

Several promoters are available to direct transcription of bacterial and 
other heterologous genes in plants. In particular, these include the 35S 
promoter of cauliflower mosaic virus (CaMV 35S), the ribulose 
25 bisphosphate carboxylase small subunit promoter and the Agrobacterium 
T-DNA octopine synthase and manopine synthase promoters. These 
promoters have been widely used, for example, in conjunction with 
bacterial genes conferring herbicide resistance (see D.M. Stalker, ibid.. 
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pp 82-104). These promoters do not confer any specificity of gene 
expression at the organ, tissue or organeilar levels, or responsiveness of 
gene expression to environmental influences such as light. 

5 A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are 
typically provided in plasmid vectors containing convenient restriction sites 
for insertion of a DNA segment of the present invention. 

10 

Typical prokaryotic vector piasmids are pUC18, pUC19, pBR322 and 
pBR329 available from Biorad Laboratories, (Richmond, CA, USA) and 
p7rc99A and pKK223-3 available from Pharmacia, Piscataway, NJ, USA. 

15 A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia, Piscataway, NJ, USA. This vector uses the SV40 late 
promoter to drive expression of cloned genes, the highest level of 
expression being found in T antigen-producing cells, such as COS-1 cells. 

20 An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
promoter of the mouse mammary tumour virus long terminal repeat to 
drive expression of the cloned gene. 

25 Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Piasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
Integrating piasmids (Yips) and incorporate the yeast selectable markers 
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HIS3, TRP1, LEVI and URA3. Plasraids pRS413-416 are Yeast 
Centromere plasmids (YCps) 



A variety of methods have been developed to operably link DNA to 
5 vectors via complementary cohesive termini. For instance, 
complementary homopolymer tracts can be added to the DNA segment to 
be inserted to the vector DNA. The vector and DNA segment are then 
joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules. 

10 

Synthetic linkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion as described 
earlier, is treated with bacteriophage T4 DNA polymerase or E. coli DNA 
is polymerase I, enzymes that remove protruding, 3 '-single-stranded termini 
with their 3 f -5'-exonucleolytic activities, and fill in recessed 3'-ends with 
their polymerizing activities. 

The combination of these activities therefore generates blunt-ended DNA 
20 segments. The blunt-ended segments are then incubated with a large 
molar excess of linker molecules in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the reaction are 
DNA segments carrying polymeric linker sequences at their ends. These 
25 DNA segments are then cleaved with the appropriate restriction enzyme 
and ligated to an expression vector that has been cleaved with an enzyme 
that produces termini compatible with those of the DNA segment. 
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Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

5 A desirable way to modify the DNA encoding the polypeptide of the 
invention is to use the polymerase chain reaction as disclosed by Saiki ei 
al (1988) Science 239, 487-491. 

In this method the DNA to be enzymatically amplified is flanked by two 
10 specific oligonucleotide primers which themselves become incorporated 
into the amplified DNA. The said specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into 
expression vectors using methods known in the art. 

15 The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell can 
be either prokaryotic or eukaryotic and it may be comprised in a 
multicellular organism such as a plant. Bacterial cells are preferred 
prokaryotic host cells and typically are -a strain of E. coli such as, for 

20 example, the E. coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RR1 available from the 
American Type Culture Collection (ATCC) of Rockville, MD, USA (No 
ATCC 31343). Preferred eukaryotic host cells include yeast and plant 
cells. Yeast host cells include YPH499, YPH500 and YPH501 which are 

25 generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Preferred plant host cells and plants include those from Nicotiana 
spp., Solatium tuberosum (potato), Brassica spp. (eg oil seed rape), Beta 
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spp. (eg sugar beet, leaf beet and beetroot). Capsicum spp. and Vanilla 
spp. 



Transformation of appropriate cell hosts with a DNA construct of the 
5 present invention is accomplished by well known methods that typically 
depend on the type of vector used. With regard to transformation of 
prokaryotic host cells, see, for example, Cohen et al (1972) Proc. Natl. 
Acad, Sci. USA 69, 2110 and Sambrook et al (1989) Molecular Cloning, 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
10 Harbor, NY. Transformation of yeast cells is described in Sherman et al 
(1986) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also 
useful. With regard to plant cells and whole plants three plant 
transformation approaches are typically used (J. Draper and R. Scott in D. 
15 Grierson (ed.), "Plant Genetic Engineering " , Blackie, Glasgow and 
London, 1991, vol. 1, pp 38-81): 



i) Agrobacterium-medizied transformation, using the Ti plasmid of A . 
tumefaciens and-the Ri plasmid of A. rhizogenes (P. Armitage, R. Walden 
20 and J. Draper in J. Draper, R. Scott, P. Armitage and R. Walden (eds.), 
"Plant Genetic Transformation and Expression - A Laboratory Manual", 
Blackwell Scientific Publications, Oxford, 1988, pp 1-67; R.J. Draper, 
R.Scott and J. Hamill ibid., pp 69-160); 



25 ii) DNA-mediated gene transfer, by polyethylene glycol-stimulated 
DNA uptake into protoplasts, by electroporation, or by microinjection of 
protoplasts or plant cells (J. Draper, R. Scott, A. Kumar and G. Dury, 
ibid., pp 161-198); 
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iii) transformation using particle bombardment (D. McCabe and P. 
Christou, Plant Cell Tiss. Org. Cult. y 3, 227-236 (1993); P. Christou, 
Plant J. , 3, 275-281 (1992)). 

5 Agrobacterium-mzdvaxeA transformation is generally ineffective for 
monocotyledonous plants (eg Vanilla), for which approaches ii) and iii) are 
therefore preferred. In all approaches a suitable selection marker, such 
as kanamycin- or herbicide-resistance, is preferred or alternatively a 
screenable marker ("reporter") gene, such as ^-glucuronidase or luciferase 

io (see J. Draper and R. Scott in D. Grierson (ed.), "Plant Genetic 
Engineering", Blackie, Glasgow and London, 1991, voL 1 pp 38-81). 

Electroporation is also useful for transforming cells and is well known in 
the art for transforming yeast cell, bacterial cells and plant cells. 

15 

For example, many bacterial species may be transformed by the methods 
described in Luchansky et al (1988) Mol. Microbiol, 2, 637-646 
incorporated herein by reference. The greatest number of transformants 
is consistently recovered following electroporation of the DNA-cell 
20 mixture suspended in 2.5 X PEB using 6250V per cm at 25^tFD. 

Methods for transformation of yeast by electroporation are disclosed in 
Becker & Guarente (1990) Methods EnzymoL 194, 182. 

25 Successfully transformed cells, ie cells that contain a DNA construct of 
the present invention, can be identified by well known techniques. For 
example, cells resulting from the introduction of an expression construct 
of the present invention can be grown to produce the polypeptide of the 
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invention. Cells can be harvested and lysed and their DNA content 
examined for the presence of the DNA using a method such as that 
described by Southern (1975) 7. MoL Biol, 98, 503 or Berent et al (1985) 
Biotech. 3, 208. Alternatively, the presence of the protein in the 
5 supernatant can be detected using antibodies as described below. 

In addition to directly assaying for the presence of recombinant DNA, 
successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression 
10 of the protein. For example, cells successfully transformed with an 
expression vector produce proteins displaying appropriate antigenicity. 
Samples of cells suspected of being transformed are harvested and assayed 
for the protein using suitable antibodies. 



15 Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those cells, preferably a 
monoclonal (clonally homogeneous) culture, or a culture derived from a 
monoclonal culture, in a nutrient medium; and also, in the case of plant 
cells, a plant derived from, and containing, such cells. 

20 

It is particularly preferred if the host cell comprises a nucleic acid which 
encodes any one of, or combination of, a polypeptide which, in the 
presence of appropriate cofactors if any is capable of catalysing the 
interconversion of mm5-feruloyl SCoA and 4-hydroxy-3- 
25 methoxyphenyl-/3-hydroxypropionyI SCoA (HMPHP SCoA) or a 
polypeptide which, in the presence of appropriate cofactors if any, is 
capable of catalysing the interconversion of 
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4-hydroxy-3-methoxyphenyl-j3-hydroxypropionyl SCoA (HMPHP SCoA) 
and vanillin. 

It will be appreciated that the host cells of the invention or an extract 
5 thereof are particularly suited for use in the methods of the invention. It 
is particularly preferred if the host cell does not contain means for 
converting vanillin to a non-vanillin product. 

It is most preferred if the host cell is a plant cell or is comprised in a 
10 whole plant or a bacterial cell or a yeast cell. Preferred bacterial hosts 
include lactic acid bacteria such as Lactococcus spp. and Lactobacillus 
spp. Preferred yeast hosts include Saccharomyces cerevisiae and its 
biovars. It is particularly preferred if the host cell is a food-grade host 
cell (for example a microorganism which is used or can be used in the 
15 food or beverage industry). It is also preferred if the plant is an edible 
plant. 

It will be appreciated that some host cells or host organisms may already 
contain enzyme activities I, II, III or IV and, in that case, it may be 
20 sufficient, in order to use the host cells in the methods of the invention, 
to introduce into said host cell or host organism one or more 
polynucleotides which encode enzyme activities II, III or IV which encode 
those enzyme activities which are deficient in the host cell or host 
organisms. 

25 

In the case of plants for use in the methods of the invention, it is preferred 
that the relevant gene expression is directed to target organs, tissues and 
subcellular organelles where rrans- feruloyl SCoA, or other appropriate 
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substrates (eg 4-fraAW-coumaroyl SCoA or rnms-caffeoyl SCoA) for the 
enzymes encoded by the transferred genes, are most readily available. 
The stage at which thioesterification with CoASH occurs in plants, in 
relation to the progressive substitution of the phenyl ring which takes place 

5 during the conversion from rra/ty-cinnamate to trans- ferula te, is unclear 
and may be variable (see R. Whetten and R. Sederoff, The Plant CelL 7, 
1001-1013 (1995)). The subcellular localisation or distribution of these 
intermediates during plant phenylpropanoid metabolism also remains 
uncertain; it is likely that they are cytosolic, or that some functional 

10 organisation of the enzymes which metabolise them occurs. The concept 
of the metabolic cluster or "metabolon", in which there is a degree of 
metabolic channelling and free diffusion is restricted, has been proposed 
and discussed (see R.A. Dixon and N.L. Paiva, The Plant CelL 7, 1085- 
1097 (1995) and loc cit.). 

15 

Trans-ferulate, 4-/rans-coumarate and mws-caffeate are normal metabolic 
intermediates. Thus, there may be no requirement to manipulate host 
plants in order to provide /rans-feruloyl SCoA. Their concentrations are 
expected to be influenced in varying degrees by physiological requirements 

20 for a wide range of end-products of the phenylpropanoid pathway, 
including for example lignin, coumarins and flavonoids. There is some 
evidence that the activity of the first enzyme of the phenylpropanoid 
pathway, phenylalanine ammonia lyase (PAL: EC 4.3. 1 .5), can influence 
the accumulation of end-products of the pathway (N. Bate, J. Orr, W. Ni, 

25 A. Meromi, T. Nadler-Hassar, P.W. Doerner, R.A. Dixon, C.J. Lamb 
and Y. Elkind, Proc. NatL Acad. ScL USA, 91, 7608-7612 (1994)) so 
under certain circumstances it is possible to enhance the metabolic effects 
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of the expression of the genes for enzyme activities II and III by 
increasing the expression of PAL. 

It is well known that gene expression in the phenylpropanoid pathway is 
5 responsive to a range of environmental and stress factors, including 
wounding, chemical elicitors of pathogenic origin, and u/v light. The 
mechanisms regulating these responses are not very well understood, 
though several transcription factors have been identified (see R.A. Dixon 
and NX. Paiva, The Plant Cell, 7, 1085-1097 (1995)). However, 
10 particularly when these are more fully characterised, they do or will offer 
opportunities for predictable and inducible control of gene expression, 
particular to enhance the provision of substrate. 

Thus, a further aspect of the invention provides a transgenic plant 
15 comprising a polynucleotide according to any of the fourth or fifth aspects 
of the invention. In other words, the transgenic plant is genetically 
engineered to encode and, preferably, express any one or more of enzyme 
activities II or III. It is particularly preferred that, following said genetic 
engineering the plant is able to produce vanillin from frans-feruloyl SCoA. 
20 It will be appreciated that, depending on the enzymes present in the host 
plant, it may be necessary only to provide a gene encoding only a single 
of said enzyme activities or it may be necessary to provide a gene or 
genes of any two of said enzyme activities. 

25 Conveniently, the transgenic plant is genetically engineered to encode and, 
preferably, express enzyme activities II and III. It can be readily seen that 
the transgenic plants of this aspect of the invention may be used in the 
methods of producing vanillin of the invention, especially when the plant 
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provides the enzyme activity that interconverts trans-femlic acid or a salt 
thereof and t rans- feruloyl SCoA. 

Preferably the plant is a plant which is readily transformed. Preferably 
5 the plant is a plant which is commonly used in agriculture or horticulture 
and more preferably the plant is an edible plant. Advantageously the plant 
is a plant in which it is desirable to introduce a vanilla flavour or aroma. 

Preferred plants include those selected from Nicotiana spp., Solatium 
10 tuberosum^ Brassica spp., Capsicum spp., Beta spp. and Vanilla spp. 

As is described in more detail below the plant may be eaten or may be 
processed into a foodstuff or beverage. Thus, conveniently the transgenic 
plant is processed or prepared so that it is not capable of reproduction or 
15 cultivation, for example the transgenic plant is harvested from the 
environment in which it was grown. 

When vanillin (or the desirable products such as /?-hydroxybenzaldehyde) 
is produced in a host cell or organism of the invention, especially if it is 

20 produced in a transgenic plant of the invention, the vanillin or desirable 
product may initially be present in the form of a glycoside, more 
particularly, a 0-D-glycoside, or, in the case of a carboxylic acid, as 
esters of j3-D-glucose (as occurs in Vanilla pod). In this case, it is 
desirable to release the vanillin (and desirable product) into its uncombined 

25 form, for example by acid- or base-catalysed hydrolysis or by the use of 
glycosides such as the /3-D-glucosidase (emulsin; S. Hestrin, D.S. 
Feingold and M. Schramm, Meth. EnzymoL /, 231-257 (1955); see also 
D. Chassagne, C. Bayonore, J. Crouzet and K. Baumes in "Bioflavour 
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95", eds. P. Etievant and P. Schreier, INRA, Paris, pages 217-222 
(1995).). 



In relation to the use of a microorganism such as Ps. fluorescens biovar 
5 V, strain AN 103 or of a microorganism which has been genetically 
modified to contain enzyme activities II and III (or at least those of these 
activities that it does not normally have), it is preferred that said 
microorganism is provided with rrans-feruloyl SCoA or a means to 
provide said CoA thioester from frons-ferulic acid or a salt thereof at least 
10 in its culture medium. 



Thus, it can be seen from all of the foregoing description that the 
invention includes biochemical and fermentative processes for producing 
vanillin and vanillic acid, recombinant or transgenic plants and the use of 
is the said plants in a method of making vanillin or vanillic acid. 



Typically, in a biochemical process the strain of Pseudomonas (eg Ps. 
fluorescens biovar. V, strain AN103) provides an enzyme system for the 
biotransformation of plant derived /nms-ferulic acid to vanillin and/or 

20 related compounds. Enzyme preparations, whole cells of Pseudomonas or 
a heterologous host organism expressing appropriate Pseudomonas genes 
may be used for this. A variety of mutants of Pseudomonas and various 
additional enzyme preparations, co-factors or co-factor regenerating 
systems may be used. The Pseudomonas enzymes might be overexpressed 

25 in a heterologous host before being extracted and used in a 
biotransformation . 
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Alternatively, but suitably, some form of fermentation process may be 
used which involves the Pseudomonas strain or an appropriate derived 
mutant or a heterologous host organism in which the genes for 
biotransformation are expressed. The chosen microorganism is typically 
5 grown on a ferulate-rich substrate or a substrate comprising rram'-feruloyl 
SCoA. This could generate a vanillin production process. 

In addition the invention includes recombinant microorganisms (eg lactic 
acid bacteria) which are modified to contain genes encoding enzyme 
10 activities II and III. For example, lactic acid bacteria modified according 
to the invention may be used to produce vanilla-flavoured yoghurt 
provided that they are supplied with /rans-feruloyl SCoA. 

Advantageously, genes for vanillin production (such as those encoding 
15 enzyme activities II and III) are expressed in a variety of plant species 
such that vanillin accumulates in an appropriate tissue. In this case a new 
crop plant may be cultivated and vanillin would be extracted. Thus, a 
sugar beet plant may be made according to the invention in which the beet 
was rich in vanillin. In addition the development of a novel plant cultivars 
20 for direct consumption (eg vanilla-flavoured capsicum), or even for their 
desirable aroma properties, included in the invention. 

The polypeptides of the invention, or the genes which encode them, may 
be used either individually or in combination (whether as substantially 
25 isolated polypeptides, or in cell-free extracts or as host cells or organisms 
which encode and, preferably, express said polypeptides) to convert a 
compound into a desirable product. Certain compounds and desirable 
products have been described above. However, the invention also 
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includes the production of any other desirable product, such as a flavour 
or aroma, from substrates related to trans- ferula te and other known 
substrates of the polypeptides (enzymes) of the invention. Thus, the 
polypeptides or genes of the invention, either individually or in 
5 combination, may be used in processes for converting, for example, 
synthetic substrates of the said polypeptides (enzymes) into novel flavours 
and aromas or they may be used to modify the chemical profile of known 
flavours or aromas. 

io Similarly, it will be appreciated that certain desirable products can be 
made by the further action of enzyme activity IV upon certain compounds, 
particularly those produced by enzyme activities I, II and III. For 
example, enzymes activities I, II and III may be used to convert trans-4- 
coumaric acid or rraAzs-4-coumaroyl SCoA to p-hydroxybenzaldehyde and 

15 enzyme activity IV may then be used to convert p-hydroxybenzaldehyde 
to p-hydroxy-benzoic acid. Thus, the invention includes a method of 
producing /^-hydroxy-benzoic acid using at least one of enzyme activities 
I, II, III and IV and advantageously using ail of them. 

20 A further aspect of the invention provides a food or beverage comprising 
a host cell comprising a polynucleotide of the invention, or an extract of 
said host cell. The host cells comprising one or more polynucleotides of 
the invention, especially those such host cells which produce vanillin by 
virtue of the presence of said polynucleotide or polynucleotides, may be 

25 used in the production of food or beverages. In particular, as discussed 
above, lactic acid bacteria which produce vanillin by the methods of the 
invention may be used in the production of cheese, yogurt and related 
products including milk drinks. Similarly, yeasts which produce vanillin 
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by the methods of the invention may be used in the production of food and 
beverages such as bread and beer. Extracts of said host cells may also be 
used in the food or beverage industry. 



5 A still further aspect of the invention provides a food or beverage 
comprising a transgenic plant comprising a polynucleotide of the 
invention, or a part or extract or said transgenic plant. The transgenic 
plant comprising one or more polynucleotides of the invention, especially 
those such transgenic plants which produce vanillin by virtue of the 

io presence of said polynucleotide or polynucleotides, may constitute the food 
itself or they may be processed to form the food or beverage. For 
example, a tuber of a transgenic potato of the invention constitutes a food 
of this aspect of the invention. Alternatively, said transgenic potato may 
be processed into another foodstuff which is, nevertheless, a food of this 

15 aspect of the invention. 



Still further aspects of the invention provide use of Pseudomonas 
fluorescens biovar. V, strain AN 103 or a mutant or derivative thereof in 
a method for producing vanillin, or vanillic acid or salt thereof; use of a 
20 polypeptide of the invention in a method for producing vanillin, or vanillic 
acid or salt thereof; use of a polynucleotide of the invention in a method 
for producing vanillin, or vanillic acid or salt thereof; and use of a host 
cell of the invention in a method for producing vanillin or vanillic acid or 
a salt thereof. 

25 

As is clear from the foregoing the invention also includes a method of 
producing vanillin or vanillic acid, or other related products, the method 
comprising providing /rans-feruloyl SCoA (or any other suitable CoASH 
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thioester which can be acted upon by enzyme activity II) and providing 
enzyme activity II, enzyme activity III and, in the case of vanillic acid 
production or another related product, enzyme activity IV. Trans-feru\oy\ 
SCoA is obtainable by the method described in Example 2 or it may be 
5 obtained using the methods of Zenk et al (1980) Anal, Biochem. 101, 182- 
187, incorporated herein by reference. Other Co ASH thioesters which 
may be substrates for enzyme II are also described in Zenk et al. 

The preferred method steps and organisms for use in the methods, and the 
10 foods and beverages of the earlier aspects of the invention are also 
preferred in this method of the invention to the extent that they are 
compatible with this method, and the organisms used in this method. 

As is discussed above, frans-feruioyl SCoA (and related Co ASH thioesters 
15 such as 4-rm/75-coumaroyl SCoA and rrans-caffeoyl SCoA) are normal 
metabolic intermediates in plants. Thus, a transgenic plant which 
comprises a polynucleotide or polynucleotides which encode, and 
preferably express, enzyme activities II and III in a location in the plant 
which contains rran^-feruloyl SCoA or other suitable CoASH thioester is 
20 particularly suited for the purposes of this aspect of the invention. As is 
described above, such transgenic plants and products derived therefrom 
form part of the invention. 

The invention will now be described in more detail with reference to the 
25 following Examples and Figures wherein: 

Figure 1 describes the vanillin pathway in Pseudomonas fluorescens 
biovar. V, strain AN 103. HMPHP SCoA is 4-hydroxy-3-methoxy- 
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phenyl-/3-hydroxypropionyl SCoA. I is an enzyme that catalyses the 
interconversion of trans-ferulic acid and rra/?j-feruloyl SCoA; II is an 
enzyme that catalyses the interconversion of frarts-feruloyl SCoA and 
HMPHP SCoA; III is an enzyme that catalyses the interconversion of 
5 HMPHP SCoA and vanillin; and IV is an enzyme that catalyses the 
interconversion of vanillin and vanillic acid. 



Figure 2 illustrates the growth of strain AN 103 following transfer to MM 
medium containing 10 mM vanillate (V), 10 mM trans- ferulate (F) or 10 
10 mM trans- ferulate plus 10 mM vanillate (FV). Cultures were previously 
grown in MM medium containing 10 mM vanillate. 



Figure 3 indicates the changes in trans- ferulate and vanillate 
concentrations during growth of strain AN 103 on MM medium containing 
15 10 mM mz/zs-ferulate. 



Figure 4 shows the production of vanillin (van) and vanillate (VA) by an 
extract of cells of strain AN 103 (165 /zg protein) incubated with 
rran.y-ferulate, ATP, CoASH and Mg 2+ ions, both in the absence of NAD + 
20 and in its presence (0.5mM). Cells were grown in the presence of 10 mM 
trans- ferulate, plus 10 mM vanillate. 

Figure 5 demonstrates the formation of feruloyl SCoA, vanillin and acetyl 
SCoA from trans- ferulate supplied to a PDlO-treated extract of 
25 rram-ferulate-grown cells of strain AN 103 ( Ifig protein) in the presence 
of ATP, CoASH and Mg 2+ ions. 
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Figure 6 demonstrates the production of vanillin, acetyl SCoA and feruloyl 
SCoA from HMPHP SCoA supplied to a PDlO-treated cell-free extract 
(7^g protein) of trans- ferula te-grown cells of strain AN 103. 

5 Figure 7 shows the induction over time of frans-ferulate:CoASH ligase 
activity in response to 10 mM frans-ferulate (F), 10 mM vanillate (V) and 
10 mM frans-ferulate plus 10 mM vanillate (FV) present in MM medium. 
The inocula were grown in MM medium plus 10 mM vanillate; growth 
conditions, enzyme extraction and assay were as described in Examples 

io 1 and 2. 

Figure 8 shows SDS-PAGE of A), an extract of cells grown in MM 
medium with 10 mM trans- ferulate, electrophoresed at successive stages 
of purification of the HMPHP SCoA cleavage enzyme; successive stages 
15 are Crude Extract, Mono Q-purified, Mono-P-purified and Phenyl 
Superose-purified, and B), extracts of cells grown in MM medium with 
either lOmM vanillate or lOmM trans- ferulate and electrophoresed 
alongside Mono-P-purified cleavage enzyme; A) silver-stained; B) 
Coomassie-stained. 

20 

Figure 9 shows EcoRI/PstI digests of cosmid clones pFI793, pFI794, 
pFI795 and pFI796 separated on an agarose gel. 

Figure 10 shows the sequence of the redundant primers designed from 20 
25 N-terminal amino residues of the 31-kDal protein (SEQ ID Nos. 5 and 6). 
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Figure 1 1 shows a Southern blot of EcoRI/PstI digests of various cosmid 
clones probed with the PCR product amplified using the N-terminal 
degenerate oligonucleotide primers as shown in Figure 10. 

5 Figure 12 shows the nucleotide sequence of pFI989 (ie the 4370 bp 
EcoRI/PstI fragment from pFI794), together with the succeeding 882bp 
determined from a further subclone, pFI1056 and from pFI794 itself (SEQ 
ID No 7). The amino acid sequence of the 31 kD protein and that 
corresponding to the succeeding open reading frame encoding 

io vanillin: NAD + oxidoreductase (vanillin dehydrogenase) (SEQ ID Nos. 2 
and 4) are also shown. 

Figure 13 shows the nucleotide sequence of pFI901 (ie the 1.8 kb 
EcoRI/PstI fragment from pFI793) (SEQ ID No 8). 

15 

Figure 14 shows the nucleotide sequence of pFI911 (ie the 850 bp 
EcoRI/PstI fragment from pFI793) (SEQ ID No 9). 

Figure 15 shows the nucleotide sequence of pFI912 (ie the 958 bp 
20 EcoRI/PstI fragment from pFI793) (SEQ ID No 10). 

Figure 16 shows the nucleotide sequence of pFI913 (ie the 959 bp 
EcoRI/PstI fragment from pFI793) (SEQ ID No 11). 

25 Figure 17 is a diagrammatic representation of the outward reading primers 
for pFI901 (P35 and P39), pFI911 (P32 and P36), pFI912 (P33 and P37) 
and pFI913 (P34 and P38). 
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Figure 18 is a diagrammatic representation showing the formation of the 
1.5 kb PGR product, using primers P34 and P39, which spans the region 
in the cosmid between the inserts of pFI913 and pFI901. 

5 Figure 19 shows the nucleotide sequence of the merged contigs 
pFI913/PCR product/pFI901 (4259 bp) (SEQ ID No 12). 

Example 1: Isolation and growth of Pseudomonas fluorescens hiovar. 
V. strain AN103 

10 

Experimental 

Growth media 

15 Organisms were grown on the following media: 

Minimal Medium (MM) contained, per 1: KH 2 P0 4 , 5g; (NH 4 ) 2 S0 4 , lg; 

FeS0 4 , 0.5 mg; CaCl 2 , 0.5 mg; MnCi 2 .5H 2 0, 5 mg; (NH 4 ) 6 Mo 2 0 7 .4H 2 0, 

1.1 mg; MgS0 4 , 5 mg; EDTA, 50 mg; ZnS0 4 .7H 2 0, 28 mg; 

CuS0 4 .5H 2 0, 1.6 mg; CoCl 2 .6H 2 0, 1.6 mg. The pH was 7.0. Carbon 
20 sources were included as indicated. 

Tryptone- and yeast-based medium (LBMod) contained, per 1 : tryptone 
(Bacto; Difco, Detroit, USA), 10 g; yeast extract (Bacto), 5 g; NaCl, 10 
g. The pH was adjusted to 7.5. LB Medium was identical with LB Mod, 
25 with the addition of glucose (lg/l). 

Solid media were prepared with the addition of agar ( Difco), 15 g/1. 
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Isolation of Pseudomonas fluoresceins biovar. V f strain AN J 03 

The organism was isolated from surface soil, on the basis of ability to 
grow on rra/is-ferulate as sole carbon source. Initially, a Ig soil sample 

5 was added to 100ml sterile Minimal Medium (MM), containing trans- 
ferulic acid (10 mM). After 2 weeks at 25 °C, with shaking at 200 rpm, 
a sample (100/xl) was removed and added to 200ml of fresh medium 
containing lOmM rr<ms-ferulic acid; this was repeated twice. Serial 
dilution onto solid medium (MM) containing lOmM fra/tr-ferulate as sole 

10 carbon source enabled isolated colonies to be obtained which were replica- 
plated onto MM plates containing individual substrates as carbon sources. 
Several strains able to use rra/is-ferulate as sole carbon source were 
isolated - one (AN 103), which was capable of growing also on vanillin, 
was selected for further work. 

15 

Growth of strain AN 103 

The organism was grown routinely at 25°C on MM, with shaking, using 
vanillic acid (lOmM) or rrans-ferulic acid (lOmM) as sole carbon source; 
20 50ml of medium was used in a 250ml Erlenmeyer flask. Growth was 
monitored by measuring absorbance at 550 or 600nm. 

For long-term storage, bacteria from logarithmic-phase cultures were 
centrifuged and then resuspended in Minimal Medium (MM) containing 
25 50% glycerol. They were then stored at -70°C. Cultures from these 
frozen stocks were reinitiated by transfer onto LB or LB-MOD solid 
medium, followed by inoculation into liquid medium containing lOmM 
trans- ferulic acid. 
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Results 

The organism was isolated from soil samples rich in decayed vegetation 
and was shown to be a strain of Pseudomonas fluorescens using standard 

5 identification techniques. As shown in Table I, the bacterium would grow 
not only on rrans-ferulate as sole carbon source, but also on several 
closely-related compounds, including vanillate, protocatechuate and 
caffeate. Growth on vanillin was observed at low concentrations ( < ImM) 
but was variable; higher concentrations were growth-inhibitory. If the 

10 organism was grown on vanillate, transfer to medium containing trans- 
ferulate as sole carbon source was followed by a lag in the growth curve; 
this was not observed if the transfer was to medium containing both 
vanillate and trans- ferulate (Fig 2). During a growth cycle on 
trans- ferulate, a transient increase in vanillate was observed at around the 

15 time when rnms-ferulate disappearance was maximal (Fig 3), suggesting 
that vanillate was a catabolite of frans-ferulate. A small amount of 
protocatechuate was also observed when the culture medium was examined 
by TLC (not shown). 
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Table I: Relative growth of Ps. fluorescens biovar. V, strain AN103 
on a range of carbon substrates 

Substrates were provided in MM Medium at 10 mM concentration 
5 (vanillin, ImM) and relative growth after 48h at 25 °C was monitored by 
measuring absorbance at 600 nm. 



Substrates 


Relative Growth (°/r>\ 

IwluilT V VJI UTTill y J\J J 


Ferulic acid 


100 


Caffeic acid 


79 


Sinapic acid 


0 


Cinnamic acid 


0 


Vanillin 


<100 


Vanillic acid 


140 


Protocatechuic acid 


77 


Protocatechuic aldehyde 


0 


Glucose 


221 


Acetate 


47 


Methanol 


0 



20 

Example 2: 7>g/is-ferulate metabolism in cell-free extracts and 
mechanism of cleavage 

Experimental 

25 

Chemicals 

Chemicals and biochemicals were routinely obtained from Sigma Chemical 
Co. Ltd, Poole, Dorset, UK, Aldrich Chemical Co. Gillingham, Dorset, 
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UK or BDH-Merck, Poole, Dorset, UK. The synthesis of CoASH 
thioesters is described below. 



Preparation of 4-hydroxy-3-methoxyphenyl-fi-hydroxypropionyl SCoA 
5 (HMPHP SCoA) 



This compound was prepared starting from a Reformatsky condensation 
of vanillin with ethyl bromoacetate (see R.L. Shriner, The Reformatsky 
Reaction in "Organic Reactions", R. Adams, W.E. Bachmann, L.F. 

10 Fieser, J.R. Johnson and H.R. Snyder, eds., vol. 1, pp 1-37, John Wiley, 
New York [1942]), followed by purification of the resulting ethyl 4- 
hydroxy-3-methoxyphenyI-i3-hydroxypropionate (ethyl HMPHP) by 
HPLC, hydrolysis to the free acid, N-succinimidylation and, finally, 
exchange of the A^-succinimidyl group with CoASH and isolation of the 

15 CoASH thioester by preparative TLC (see V. Semler, G. Schmidtberg and 
G.G. Gross, Z. Naturforsch. 42c, 1070-1074 [1987]). 



Vanillin (3g) was mixed with 1.9 ml of ethylbromoacetate and 2g of dry 
Zn dust in 60 ml of dry 1 ,4-dioxane in a round-bottomed flask fitted with 

20 drying tubes and a reflux condenser. The reaction mixture was heatea 
gently to boiling using a heating mantle and refluxed gently for ca.lh. 
After being allowed to cool, the mixture was acidified with 60 ml of 10% 
H 2 S0 4 and extracted with 4 x 120 ml of diethyl ether. The combined 
ether phases were dried with anhydrous Na 2 S0 4 and unreacted vanillin was 

25 removed by washing with 3 x 100 ml of sat. K 2 S 2 O s . The ether phase 
was then rotary evaporated under vacuum at ca. 30°C to remove the 
ether, leaving a liquid residue (or 10ml). This was then applied to a 
preparative C-18 reverse-phase HPLC column (Dynamax 60 A, 8/u,m, 
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250mm x 41mm; Rainin, Woburn, MA, USA) and eluted at 12 ml min 1 
with a gradient of MeOH/H 2 0, containing ImM trifluoroacetic acid. 
[Solvent A comprised 40% MeOH/lmM trifluoroacetic acid; solvent B 
comprised 100% MeOH/lmM trifluoroacetic acid; at time = 0 min, 

5 solvent was 20% B, rising linearly to 40% B at 28 min and 100% B at 35 
min]. Fractions were monitored by absorbance at 280 run and material 
eluting between 37 and 45 min was collected. The solvent was removed 
under vacuum at ca. 35 °C and the remaining material brought to -20 °C 
overnight. The precipitate which formed was then filtered off rapidly and 

io freeze-dried to give 300mg of white substance. This was identified as 4- 
hydroxy-3-methoxyphenyl-0-hydroxypropionic acid ethyl ester (ethyl 
HMPHP) by MS [M]=240 and, on alkaline hydrolysis (1M KOH; 30 
min), gave rise to the free acid. 



15 To generate the N-succinimidyl ester, 30 mg of ethyl HMPHP was 
hydrolysed for 40 min at room temperature in 0.5 ml of 1M KOH. 
Oxalic acid (0.6 ml of 0.5 M) was then added to bring the pH to ca. 3-4. 
The solution was extracted successively with EtoO (5 x ca. 10ml); the 
organic phases were then pooled and evaporated to dry ess. TV - 

20 Hydroxysuccinimide (0.1 mmol; 11.5 mg) was then added in 1.2 ml of 
dry 1,4-dioxane. This was then followed, gradually, by 0.1 mmol (20.7 
mg) of dicyclohexylcarbodiimide in 0.6 ml of dry 1,4-dioxane. The 
reaction mixture was allowed to stand at room temperature for ca. 4h and 
then filtered to remove precipitated DCU, a further 1 .8 ml of dry dioxane 

25 being added to wash the filter. 

The N-succinimidyl ester was not isolated from the reaction mixture but 
was converted in situ into the CoASH thioester. Lithium CoASH (40 mg; 
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ca. 0.05 mmoi) was dissolved in 2.4 ml of 0.1M NaHC0 3 and the 
reaction mixture was added; the exchange reaction was performed under 
N 2 , with stirring, for ca. 2h at room temperature. The pH of the mixture 
was then adjusted to ca.3-4 by the addition of 70/xl of 2.8 M HCI and the 

5 mixture was stored at -70 °C. The Co ASH thioester of HMPHP was 
finally isolated by preparative TLC. Cellulose TLC plates (Avicel; 
lOOOjxm; Analtech, Newark, DE, USA), to each of which was applied 200 
^l of reaction mixture, were developed in nBuOH/ HOAc/ H 2 0 (5/2/3, 
v/v/v). The Co ASH thioester was localised at R F 0.4 - 0.5 using a short- 

io wave u/v lamp and recovered from the plate by scraping and elution with 
50% MeOH. Identification was confirmed by MS [M =960] and by 
hydrolysis to the free acid (cf. ethyl HMPHP) which was measured by 
HPLC and used routinely as the basis for assay. The CoASH thioester 
showed an absorption maximum at 258 nm, and lacked the absorption 

15 maximum at 345 nm characteristic of rra/w-feruloyl SCoA. This molecule 
- and the corresponding ethyl ester and free acid - carry an asymmetric 
centre at the /3-carbon; however, no attempt was made here to resolve the 
optical isomers during or after synthesis. 

20 Preparation of vanilloyl SCoA 

Vanilioyl SCoA was produced from vanillic acid via the iV-succinimidyl 
ester, essentially according to the method described by V. Semler, G. 
Schmidtberg and G.G. Gross (Z. Natursforsch. 42c, 1070-1074 [1987]) for 
25 the synthesis of piperoyl SCoA. 

To a stirred solution of vanillic acid (5 mmol) and N - 
hydroxysuccinimide (5 mmol) in 30 ml of dry 1 ,4-dioxane was added, in 
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small portions, 7.5 mmol of solid dicyclohexylcarbodiimide. The solution 
was stirred overnight at room temperature, and precipitated DCU was 
removed by filtration. The filtrate was evaporated under reduced pressure 
at 40°C and the oily residue dissolved in boiling CHC1 V The N- 
5 succinimidyl vanillate was crystallised from solution by the dropwise 
addition of petroleum ether (b.p.: 30°C - 40° C). Approx 750 mg was 
recovered. 



To generate vanilloyl SCoA from N - succinimidyl vanillate, Co ASH 
10 (sodium salt; 200 mg) was dissolved in 4 ml of 0.1 M NaHC0 3 . N - 
Succinimidyl vanillate (120mg in 4 ml of dioxane) was then added 
gradually over a ca.40 min period at room temperature, sparging with N 2 . 
A further 4 ml of 0.1M NaHC0 3 was then added, together with a further 
8 ml of dioxane. Incubation under N 2 at room temperature, with stirring, 
15 was continued for a further Ih. The pH was then adjusted to ca. 2.8 with 
1M HC1 and the solution was frozen and stored at -70°C. Isolation of 
vanilloyl SCoA was by preparative TLC as described above with n 
BuOH/HOAc/H 2 0(5/2/3, v/v/v) as solvent. Vanilloyl SCoA (R F . 0.5-0.6) 
was identified using a short-wave u/v lamp and recovered by scraping, 
20 elution with 40% MeOH and freeze-drying. Identification was confirmed 
by MS ([M] = 916) and the thioester liberated vanillic acid on alkaline 
hydrolysis. 

Preparation of trans-feruloyl SCoA and other cinr.amoyl SCoA thioesters 

25 

7ra/u-feruloyl SCoA was prepared from rrans-ferulic acid via the N - 
succinimidyl ester, as described above for vanilloyl SCoA. Final isolation 
was achieved similarly by preparative TLC and elution, identification 
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being confirmed by MS and by alkaline hydrolysis to free trans-ferulic 
acid. Caffeoyl and /?-coumaroy 1-SCoA thioesters were prepared similarly. 

Preparation of cell-free extracts 

5 

Cell-free extracts of logarithmic-phase cultures (6-10h after inoculation) 
were prepared by sonication. Cells from ca. 200ml of medium were 
pelleted by centrifugation, and resuspended in 5- 10ml of Extraction Buffer 
(routinely 40mM KPi; pH 7.2, containing lOmM dithiothreitol). They 

10 were then sonicated (MSE Soniprep 150; Fisons Instruments, Crawley, 
Sussex, UK) at 4°C (5 x 20s; 22 Amplitude microns on full power), and 
centrifuged (20 000 x g\ 20 min; 4°C). Extracts were routinely stored 
frozen at -70°C and in some instances buffer-changed using a PD10 
column (Pharmacia) before use. The protein contents of extracts were 

15 variable - between 0.25 and 1.8 mg/ml. 

Incubation of cell-free extracts 

Cell-free extracts were routinely incubated at 30°C and pH 7.5 in a 
20 reaction mixture (1ml) containing 90mM Tris HC1 buffer and 2.5mM 
MgCl 2 , together with (as appropriate) 0.5mM frans-ferulic acid, 0.2mM 
Co ASH (Li salt) and 2.5 mM ATP. This complete reaction mixture 
constituted an assay for rra/w-ferulate: CoASH ligase, where the initial 
increase in absorbance at 345 nm was monitored against a blank reaction 
25 mixture from which CoASH was omitted. Incubations with HMPHP 
SCoA (generally 0.4mM) were performed similarly, but with the omission 
of rraw-ferulic acid, CoASH and ATP. 
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Vanillin: NAD + oxidoreductase was assayed at 30°C and pH 7.0 by 
monitoring the initial decrease in absorbance at 340 nm against a blank 
cuvette from which NAD + was omitted. Because of the similarity in 
extinction coefficient at 340nm for vanillin and for NADH, the sensitivity 
5 of the assay was increased by catalysing the regeneration of NADH to 
NAD + by providing lactate dehydrogenase and pyruvate. Reaction 
mixtures contained, in 1ml volume, 75 mM KPi buffer, pH 7.0, 0. 125mM 
vanillin, 1 .2 mM Na pyruvate, lactate dehydrogenase (rabbit muscle), 1 . 1 
U and NAD + , 0.5 mM. 



10 



HPLC analysis 



Metabolites of rrans-ferulic acid, including the CoASH thioesters, were 
analysed and quantitated by HPLC using a Lichrosorb RP-18 column (20 

15 cm x 4.6 mm; Capital HPLC, Broxburn, West Lothian, UK) with a 
multiphasic gradient; solvent A" was 20mM NaOAc, adjusted to pH 6.0 
and solvent B" was MeOH; the flow rate was 1 .2 ml/min; the proportion 
of solvent B" rose linearly from 0% at 0 min to 10% at 15 min and 
thence to 50% at 40 min and 70% at 45 min, finally decreasing to 0% at 

20 50 min. Detection was with a Spectra Focus detector (Thermo Separation 
Products, Stone, Staffs. UK), which permitted u/v spectral analysis of 
each eluting component. 

Typical approximate retention times were: CoASH, 3 min; vanillic acid, 
7 min; trans-ferulic acid, 19 min; acetyl SCoA, 22 min; HMPHP SCoA, 
25 29 min; vanilloyl SCoA, 31 min; vanillin, 31 .5 min; rrans-feruloyl SCoA, 
34 min. 
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Mass spectrometry 

Mass spectra ( + -ve and -ve ion) were recorded on a MS 9/50 mass 
spectrometer (Kratos Instruments, Manchester UK), using xenon fast atom 
5 bombardment (FAB) at a potential of 5-7 kV using glycerol as matrix (see 
G.R. Fenwick, J. Eagles and R. Self, Biomedical Mass Spectrometry 10, 
382-386 (1983)). 

Protein assay 

10 

Protein was assayed by the method of M.M, Bradford {Anal. Biochem. 
72, 248-254 (1976)), using Bio-Rad dye reagent (Bio-Rad Laboratories, 
Richmond, CA, USA) and bovine serum albumin as standard. 

15 Results 

Crude extracts of Ps. fluorescens biovar. V, strain AN103, from cells 
grown on vanillate together with ?ra/*s-ferulate, were able to produce 
vanillate when supplied with ?r<2/?s-ferulate, CoASH, ATP, Mg 2+ ions and 
20 NAD + . In the absence of NAD + , vanillate was not formed and vanillin 
accumulated in its place (Fig 4). The quantity of vanillin formed in the 
absence of NAD + was smaller than the amount of vanillate formed in its 
presence. Essentially no vanillin accumulated in the presence of NAD*. 

25 This utilisation of frans-ferulate by crude extracts was dependent upon 
CoASH and ATP and partially upon Mg 2+ ions (Table II). These 
properties indicate an initial activation of trans- ferulate to rrans-feruloyl 
SCoA by fAWtf-ferulate: CoASH ligase. This was further shown by the 
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rapid development of a CoASH-dependent absorbance maximum at 345 
nm and particularly by a transient bathochromic shift, causing the 
appearance of a yellow colour, if the reaction mixture was made alkaline 
with NaOH (data not shown). In the initial stages of the overall reaction, 
5 the linear increase in absorbance at 345 nm enabled the activity of 
rrans-ferulate: CoASH ligase to be assayed directly. During the later 
stages, however, absorbance at 345 nm would be contributed by both 
/rarts-feruioyl SCoA and vanillin or, in the presence of NAD + , NADH, 
each of which has substantial absorbance at this wavelength. 

10 

Table II: Cof actor requirements for /ra/is-ferulate utilisation by P. 
fluorescens biovar. V, strain AN 103 cell-free extracts 



Reaction mixtures (165 tig protein) were incubated for 4h at 30 °C as 
15 described in Experimental, with omissions from the complete reaction 
mixture as indicated. 



Reaction 
Mixture 


Reaction products (nmol) 


7ra/is-ferulate 
(remaining) 


Vanillin 


Vanillate 


Complete 


267 


n.d. 


311 


-CoASH 


550 


n.d. 


40 


-NAD + 


258 


228 


23 


-ATP 


513 


n.d. 


33 


-Mg 2+ 


425 


n.d. 


153 



25 n.d. = not detectable 

An overall non-oxidative cleavage of trans- feruloyl SCoA is implied in Fig 
5 which shows, in the absence of NAD", an equivalence between the 
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formation of vanillin and that of acetyl SCoA. (The formation of [2- l3 C] 
acetyl SCoA from trans- ferula te l3 C-labelled in the or-carbon atom was 
also confirmed by NMR spectroscopy (not shown).) The cleavage 
mechanism was investigated further by synthesising chemically the 

5 hydrated derivative of /ra/w- feruloyl SCoA, 4-hydroxy-3-methoxy- 
phenyl-jS-hydroxypropionyl SCoA (HMPHP SCoA). This was incubated 
with cell-free extract and shown to be converted rapidly to acetyl SCoA 
and vanillin, in equimolar proportions (Fig 6). A smell of vanillin was 
obtained when HMPHP SCoA was used as a substrate. This hydrated 

10 intermediate was not only metabolised in the forward direction, however, 
since an almost equivalent back reaction to feruloyl SCoA (putatively 
trans) was also observed. The rapidity of utilisation of HMPHP SCoA 
was consistent with the failure to observed its accumulation, using HPLC, 
during cell-free incubations with /ra/u-ferulate, CoASH, ATP and Mg 2 * 

15 ions. This cleavage of HMPHP SCoA in the absence of NAD + indicated 
no intervening j3-oxidation to the /3-keto thioester 
(4-hydroxy-3-methoxy benzoyl) acetyl SCoA (cf. M.H. Zenk, AnaL Z 
Pflanzenphysiol 53, 404-414 (1965)). Attempts to prepare this compound 
for cell-free studies were unsuccessful, but its expected cleavage product. 

20 vanilloyl SCoA, was prepared and shown not to be metabolised to vaniiiin 
by cell-free extracts in the presence of NADH, even when simultaneous 
incubations with frans-ferulate in the absence of NAD + actively produced 
vanillin. 

25 Besides trans-ferulic acid, caffeic acid and /?-coumaric acids were 
converted to thioesters of CoASH by crude extracts of Ps. fluorescens 
biovar. V, strain AN 103 (Table III). 
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Table HI. Formation of CoASH thioesters of /rans-/?-hydroxycinnamic 
acids by crude extracts of Ps. fluorescens AN 103. The activity was 
assayed spectrophotometrically as described for /rans-ferulate: CoASH 
ligase, measuring the initial rate of increase in absorbance at 345 nm in 
5 the case of trans-feruloyl SCoA formation, and at the corresponding 
absorbance maxima for the other SCoA thioesters. 



Substrate 


Activity (nkat/mg protein) 


Ferulate 


0.50 


Caffeate 


0.39 


p-Coumarate 


0.37 



Example 3: Mutants in rrg/is-ferulate metabolism 



L5 Experimental 



Mutagenesis of Pseudomonas fluorescens biovar. V, strain AN 103 

Ethyl methanesulphonate (EMS) was used for mutagenesis. Bacteria were 
20 grown for 2 d at 25* C in minimal medium (MM) with vanillic acid as 
carbon source; 1 ml of culture was then inoculated into 50 ml of LB-MOD 
and grown for 16 h at 25 °C. The cells were centrifuged and resuspended 
in 0. 1M KH 2 P0 4 (1 .25 ml) to give a cell density of 4 xlO 9 cells/ml (OD 580 
of 1.0 = 6 x 10 7 cells/ml). An aliquot of this cell suspension was serially 
25 diluted (10 2 -10 8 ) and plated onto LB-MOD plates (0.1 ml per plate) to 
provide control cell counts for assessment of the efficiency of 
mutagenesis. 
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A cell suspension (1 ml) was incubated with 0.08 ml of EMS in a total of 
3 ml of 0.1M KH 2 PO, at 37°C for 45 min. The cells were then 
precipitated by centrifugation at 4°C and the cell pellet was washed twice 
with 10 ml of LB-MOD medium, prior to resuspension in 1 ml of this 
5 medium. An aliquot was serially diluted (10 MO' 8 ) and plated onto LB- 
MOD plates; these were incubated at 25°C overnight, together with the 
plates of the unmutagenised cells, to obtain an estimate of kill (70% kill 
indicates efficient mutagenesis). The remaining mutagenised cells (0.9 ml) 
were inoculated into LB-MOD medium (50 ml) and incubated overnight 
10 at 25 °C. The mutagenised cells were then enriched for mutants in trans- 
ferulate utilisation by treatment with carbenicillin in minimal medium 
(MM) in the presence of rrans-ferulic acid. The cells were harvested by 
centrifugation at 4°C, washed twice with MM (10 ml) and resuspended in 
20 ml of MM. A sample (1 ml) was inoculated into MM (15 ml) and 
15 incubated at 25 °C for 1 h; then rrans-ferulic acid (lOmM final 
concentration) and carbenicillin (2 mg/ml final concentration) were added. 
A control flask was prepared containing trans-ferulic acid, but not 
carbenicillin. Both flasks were incubated overnight at 25 °C for 16 h, 
monitoring OD 580 to estimate growth and confirm the effectiveness of the 
20 antibiotic. Penicillinase (10 units) was then added to destroy the 
carbenicillin, incubating overnight at 25 °C. The cells were harvested by 
centrifugation at 4°C, washed twice in MM (10 ml) and resuspended in 
5 ml of MM; 1 ml of these resuspended cells were then inoculated into 50 
ml of MM containing lOrnM vanillic acid and incubated at 25 °C for ca, 
25 24 h. 

These enriched cells were screened by replica-plating for mutants unable 
to use trans- ferulic acid. The enriched stock was diluted to 10^ , plated 
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onto LB-MOD (0.1 ml per plate), incubated at 25 °C for 2 d and then 
replica-plated onto MM containing lOmM vanillic acid or lOmM trans- 
ferulic acid. The plates were incubated at 25 °C for 2-3 d and screened for 
colonies able to grow on vanillate but unable to grow on fra/?5-ferulate. 

5 

Results 



By mutagenesis of strain AN 103 with ethyl methane sulphonate, two 
classes of mutants unable to utilise /ra/zs-ferulate as sole carbon source 

10 were isolated; these were van 1, van 2 and van 3 and, secondly, van 10 
and van 11. Following growth on vanillate plus fra/w-ferulate, a 
representative of the first of these, van 1, showed no activity in cell-free 
incubations with either rra/is-ferulate or vanillin and lacked both 
/r<ms-ferulate: CoASH ligase and the enzyme that converts vanillin to 

15 vanillate, vanillin: NAD + oxidoreductase. In contrast, the type 
representative of the second class, designated van 10, possessed levels of 
activity of both fnmr-ferulate: CoASH ligase and vanillin: NAD* 
oxidoreductase similar to those found in strain AN 103, but in the presence 
of NAD* generated very little vanillate (Table IV). Cell-free extracts of 

20 van 10 were examined further for their ability to metaoolise HMPHP 
SCoA. They metabolised this thioester actively, but appeared 
predominantly to dehydrate it to feruloyl SCoA; vanillin formation was 
substantially inhibited in comparison to the AN 103 extract (Table V). 
These observations suggested that van 1 was a regulatory mutant, 

25 defective in its induction by rra/u-ferulate, whilst van 10 appeared to be 
defective in the HMPHP SCoA cleavage activity. 
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Table IV: Trans-ferulate metabolism in cell-free extracts of P. 
fluorescens biovar* V, strain AN103 and of mutant strains van 1 and 
van 10. 

5 Cells were grown for 6h in MM medium containing 10 mM vanillate 
together with 10 mM rra/u-ferulate. Extracts were then prepared as in 
Example 2 and assayed for /ra^-ferulate;CoASH ligase and 
vanillin: NAD + oxidoreductase. Extracts (ca. 0.3 mg of protein) were also 
incubated for 4h in the presence of NAD + to determine relative amounts 

io of vanillate formed. 



Strain 


Enzyme activity (nkat mg 1 protein) 


Vanillate formed 


Ferulate: CoASH 
ligase 


Vanillin: NAD 
oxidoreductase 


(nmol mg 1 protein) 


AN 103 


1.7 


1.2 


807 


van 1 


0 


-0.05 


0 


van 10 


2.0 


1.4 


59 



Table V: Utilisation of HMPHP SCoA by extracts of Ps. fluorescens 
biovar. V, strains AN103 and van 10 

20 Extracts (AN 103, 14 ^tg protein; van 10, 68 fig protein) were incubated 
at 30°C for 7 min in 1 ml vol containing 0.3 mM HMPHP SCoA. The 
increase in absorbance at 345 nm was measured against a blank reaction 
mixture containing no extract. Vanillin formation was measured by 
HPLC; the production of feruloyl SCoA was in this instance calculated 

25 from the increase in absorbance at 345 nm, after subtraction of the 
contribution from vanillin. 
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Strain 


AA 34S 


Feruloyl SCoA 
(nmol) 


Vanillin (nmol) 


AN103 


0.75 


18.4 


23.8 


van 10 


0.63 


29.2 


4.5 



5 Example 4: Induction of //wis-ferulate metabolism in strain AN103 
and purification of HMPHP SCoA cleavage enzvme 

Experimental 

10 Purification of trans-feruloyl SCoA hydratase/aldol cleavage enzyme 

Cells (from 2 1 of culture grown for 72 h on MM with lOmM rrans-ferulic 
acid as substrate; OD 565 ca. =0.5) were extracted essentially as described 
in Example 2 to give 50 ml of crude extract, containing L28 mg of 
15 protein/ ml. 

Extract (16 ml, diluted to 40 ml), was applied at room temperature and 2 
ml/min to a Mono Q HR10/10 anion-exchange column (Pharmacia, 
Piscataway, NJ, USA), preequilibrated with 20 mM Tris buffer (pH 7.5) 
20 containing 10 mM dithiothreitol. After elution of unadsorbed protein, 
protein bound to the column was eluted with a linear gradient of 
increasing NaCl concentration: from 0 to 0.5M NaCl in 100 ml of buffer. 

Fractions eluting between 0.18 and 0.3 M NaCl and containing activity 
25 with HMPHP SCoA, as determined using the microtitre plate assay (see 
below), were pooled and buffer-changed by dialysis into 25 mM bis-Tris 
buffer, containing 10 mM dithiothreitol and adjusted to pH 7.1 with 
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iminodiacetic acid. They were then applied at 0.75 ml/min to a Mono P 
HR 5/20 chromatofocusing column (Pharmacia), preequilibrated with the 
same buffer. After eluting unadsorbed protein from the column, adsorbed 
protein was eluted with a gradient of decreasing pH, generated by 
5 applying 46 ml of 10% (v/v) Polybuffer 74 (Pharmacia), containing 10 
mM dithiothreitol and adjusted to pH 4.0 with iminodiacetic acid. 
Activity with HMPHP SCoA was eluted between pH 5.5 and 5.1. 

The active fractions were again pooled together, and buffer-changed into 
20 mM Tris buffer (pH 7.5), containing 1.7 M (NH 4 ) 2 S0 4 and 10 mM 
dithiothreitol, using PD10 columns (Pharmacia), before application at 0.5 
ml/min to a PhenylSuperose HR 5/5 hydrophobic interaction 
chromatography column (Pharmacia) preequilibrated with this buffer. 
Elution of bound protein was achieved with a decreasing gradient of 
(NH 4 ) 2 S0 4 in buffer, from 1.7 M to zero over 30 ml and then continuing 
with buffer alone for a further 5 ml. Activity with HMPHP SCoA was 
eluted in this final 5 ml of buffer. 

At each stage of purification, active fractions were detected by a micro- 
20 adaptation of the assay with HMPHP SCoA described above (Example 2); 
reactions were performed in 100^1 of reaction mixture for ca. 4 min at 
room temperature in microtitre wells and absorbance was then measured 
in an MR 5000 microtitre plate reader (Dynatech, Guernsey, Channel 
Islands), equipped with a 340 nm filter. The activity of the pooled 
25 fractions was measured using HPLC to determine the reaction products of 
both HMPHP SCoA, (0.4mM) and rra/25-feruloyl SCoA (0.28mM) as 
substrates. Reaction mixtures containing 10/il of enzyme were incubated 
in 100 fx\ volume as described above; the reaction was terminated with 



10 



15 



BNSDOCI0: <WO 9735999A2J_> 



WO 97/35999 PCT/GB97/00809 

71 

100 m! of acidified MeOH (pH 3) after 2 min (HMPHP SCoA) or 5 min 
(/ra/is-feruloyl SCoA) of incubation at 24°C. The proportionality of the 
reactions with time and with quantity of enzyme was established in 
preliminary determinations. 

5 

Samples (10/d)of enzyme at each stage of purification were analysed by 
SDS-PAGE, with Coomassie or silver staining, essentially as described by 
H. Schagger and G. von Jagow, Anal. Biochem. 166, 368-379 (1987). An 
Atto AE6450 gel apparatus was used (supplied by Genetic Research 
10 Instrumentation, Dunmow, Essex, UK). 



Electrocution of protein bands from fixed, stained gels was performed 
using a Bio-Rad Model 422 electroeluter according to the manufacturer's 
directions. Eluted protein was then deposited by centrifugation onto a 
15 Pro-Spin membrane (Applied Biosy stems, Foster City, CA, USA) used in 
accordance with the manufacturer's recommendations). N-Terminal 
sequencing was performed by Alta Bioscience, University of Birmingham, 
Birmingham, UK. 



20 Results 



The time-course of induction of /ra/2s-ferulate:CoASH ligase in strain 
AN 103 is shown in Fig 7. Following transfer of vanillate-grown cells to 
medium containing rrans-ferulate, the specific activity of the ligase in 
25 extracts increased approximately linearly over an 8-hour period. An 
essentially identical time-course was obtained if the cells were transferred 
instead to medium containing both fraAW-ferulate and vanillate. Vanillate, 
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a catabolite of /raws-ferulate, therefore did not inhibit the induction 
process. 

Induction of the capacity to grow on a different substrate represents a 
5 significant shift in primary metabolism, which in principle might be 
detectable by electrophoresis of a crude protein extract. Cell-free extracts 
analysed by SDS-PAGE, with Coomassie staining (see Experimental), did 
indeed show a distinct difference in protein banding between 
/ra/w-ferulate-grown cells and vanillate-grown cells (Fig 8) . Extracts from 
10 frans-ferulate-grown cells exhibited a new, or very strongly enhanced, 
band corresponding to a polypeptide of molecular weight ca. 31 kD. 

N-Terminal amino acid sequencing of this polypeptide, following its 
removal from the gel by electroblotting, gave the following sequence: 
15 Ser-Thr-Tyr-Glu-Gly-Arg-Trp-Lys-Thr-Val-Lys-Val-Glu-Ile-Gln-Asp-Gly- 
He-Ala-Phe (SEQ ID No 13). 

Purification of HMPHP SCoA-utilising activity was achieved by Fast 
Protein Liquid Chromatography (FPLC - Pharmacia). As described 
20 above, fractions were screened for activity with HMPHP SCoA using a 
microtitre plate scanner; active fractions were then pooled together and 
activity was then measured both with HMPHP SCoA and with trans- 
feruloyl SCoA, determining the reaction products by HPLC. 

25 The results of the purification are given in Tables VI and VII. Vanillin 
and acetyl SCoA were produced in approximately equimolar amounts, 
throughout the purification, with either HMPHP SCoA or /ra/is-feruloyl 
SCoA as substrate. There was an approximate copurification of the 
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activities with trans - feruloyl SCoA and HMPHP SCoA as substrate, 
including the formation of feruloyl SCoA from HMPHP SCoA 
(dehydratase reaction; reverse of reaction II): vanillin-forming activity 
from fra/w-feruloyl SCoA (reactions II -fill) was purified lL5-foId, 
5 vanillin-forming activity from HMPHP SCoA (reaction III) was purified 
1 1 .7-fold and the dehydratase reaction (reverse of reaction II) was purified 
9,1-fold. Approximately 20-25% of each of these activities was finally 
recovered. 

10 Table VI: Purification of frans-feruloyl SCoA hydratase/aldol cleavage 
enzyme. 

Purification from cells of Ps. fluorescens biovar V, strain AN 103, was 
undertaken as described in Experimental, measuring activity - vanillin as 
15 product - with both rra/w-feruloyl SCoA and HMPHP SCoA as substrates. 
Values in parentheses show activity measuring acetyl SCoA as product. 



Part 1 



Purification 
stage 


Total activity (nkat) 


Total 
protein 

(mg) 


Specific activity 
(nkat/mg) 


trans- 

feruloyl 

SCoA 


HMPHP 

SCoA 


trans- 

feruloyl 

SCoA 


HMPHP 

SCoA 


Crude 
Extract 


85.2 
(68.2) 


59.2 
(54.9) 


20.5 


4.16 


2.89 


Mono Q 
Fractions 


80.6 
(66.2) 


45.1 
(42.9) 


4.37 


18.4 


10.3 


Mono P 
Fractions 


56.9 
(51.9) 


35.4 
(33.4) 


1.61 


34.1 


21.2 


Phenyl 

Superose 

Fractions 


22.1 
(19.8) 


15.5 
(15.1) 


0.46 


48.0 


33.7 
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Part 2 



Purification 
stage 


Ratio of 
activities 


Purification (fold) 


Recovery (%) 


trans- 

feruloyl 

SCoA 


HMPHP 
SCoA 


trans- 

feruloyl 

SCoA 


HMPHP 

SCoA 


i^ruoe 
Extract 


1 A A 

1 .44 


1.00 


1.00 


100 


100 


Mono Q 
Fractions 


1.79 


4.42 


3.56 


94.6 


76.2 


Mono P 
Fractions 


1.61 


8.20 


7.34 


66.8 


59.8 


Phenyl 

Superose 

Fractions 


1.42 


11.5 


11.7 


25.9 


26.2 



15 Table VII: HMPHP SCoA dehydratase activity during purification of 
frans-feruloyl SCoA hydratase/aldol cleavage enzyme 



Conditions and other data as Table VL Dehydratase reaction measured as 



feruloyi SCoA production. 



Purification 
stage 


Total 
activity 
(nkat) 


Specific 
activity 
(nkat/mg) 


Purification 
(fold) 


Ratio to 
HMPHP SCoA 
Cleavage 
Activity* 


Crude 
Extract 


171 


8.34 


1.00 


2.89 


Mono Q 
Fractions 


121 


27.7 


3.32 


2.69 


Mono P 
Fractions 


101 


60.5 


7.25 


2.85 


Phenyl 

Superose 

Fractions 


34.9 


75.9 


9.10 


2.25 



* see Table VI. 
1 
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SDS-PAGE of the combined active fractions at each stage revealed the 
enhancement of a 31 kD protein band (Fig. 8), indicating purification to 
apparent homogeneity after chromatography on PhenylSuperose 
(Pharmacia). This band co-migrated with the band associated with growth 
5 of strain AN 103 on /rans-ferulate and gave the same N-terminal amino- 
acid sequence: Ser-Thr-Tyr-Glu-Gly-Arg-Trp (SEQ ID No 14). 

Definitive proof of the catalysis of both reactions II and III by this protein 
was achieved as a result of expression of the gene in Escherichia coli, (see 
10 Example 5). 

The Mono-P-purified enzyme was able to accept, as alternative substrates 
to /ra/u-feruloyl SCoA, /rans-caffeoyl SCoA and /r<m?-4-coumaroyl SCoA 
(Table VIII). 

15 

Table VIII . Utilisation of rrarts-p-hydroxycinnamoyl SCoA thioesters by 
trans- feruloyl SCoA hydratase/aldol cleavage enzyme. Activity was 
determined at 30°C and with 0.4 mM substrate using enzyme from Ps. 
fluorescens AN103 (2.8 fig of enzyme protein, partially purified by Mono- 
20 Q and Mono-P chromatography) as described in Experimental. 



Substrate 


Activity (nkat/mg of protein) 


Feruloyl SCoA 


0.60 


Caffeoyl SCoA 


0.36 


p-Coumaroyl SCoA 


0.72 
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Example 5: Isolation of the genes required for the conversion of trans- 
ferulovi SCoA to vanillic acid (vanillate) in Pseudomonas fluorescens 
strain AN103 

5 A strain of Pseudomonas fluorescens (biovar. V, AN 103) was isolated 
from soil at the Institute of Food Research, Norwich Laboratory, which 
was able to grow on rrans-ferulic acid converting it to vanillic acid via 
vanillin. The proposed biochemical pathway for the conversion of trans* 
ferulic acid to vanillic acid shown in Fig 1 was substantiated in the 

10 experiments described above in Experiments 2-4. 

In order to clone the genes required for the conversion of rram-feruloyl 
SCoA to vanillic acid the strategy of complementing mutant derivatives of 
Ps. fluorescens AN 103 that were unable to grow on fra/w-ferulate as sole 
15 carbon source was used. The isolation and characterization of mutants is 
described above in Example 3 and mutants van 10 and van 11 were used 
for clone isolation. As described in Example 3, these mutants appeared 
to be defective in a gene involved in the conversion of trans-femloyl 
SCoA to vanillin. 

20 

A genomic library of Ps. fluorescens AN 103 DNA was prepared in the 
mobilisable cosmid cloning vector pLAFR3 (B. Staskawicz, D. Dahlbeck, 
N. Keen, and C. Napoli, J. Bact. 169, 5789-5794 (1987)). Genomic 
DNA was isolated from Ps. fluorescens AN 103 and partially digested 
25 with Sau 3A1 at 37 °C for 7-10 min. The DNA was then size- fractionated 
on a NaCl gradient (1.25-5M). The fraction containing DNA of 20-40 kb 
was selected and 0.5 /xg ligated into the dephosphorylated Bam HI site of 
the broad- host-range cosmid cloning vector, pLAFR3. One half of the 
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ligation mix was packaged into bacteriophage lambda particles using a 
Gigapack II XL kit (Stratagene, La Joila, CA, USA). The packaged 
cosmids were transfected into Escherichia coli strain 803 (W.B. Wood, J. 
MoL BioL 16, 118-133 (1966)). Approximately 10,000 primary 
5 transfectants were obtained. The lawn of cells obtained was washed from 
the selection plates and glycerol-containing stocks prepared for storage at 
-70°C. 



The genomic library of Ps. fluorescens AN 103 DNA in cosmid pLAFR3 

to was introduced into the two mutant Ps. fluorescens derivative strains 
vanlO and vanll using the helper plasmid, pRK2013 (D. Figurski and 
D.R. Helinski, Proc Natl. Acad. Sci. USA 76, 1648-1652 (1979)). The 
mutant strains were inoculated into minimal medium MM containing 
lOmM vanillic acid and incubated at 25 °C for 2 days. The Escherichia 

15 coli strain carrying the helper plasmid (E.coli 803pRK2013) was 
inoculated into LB-Mod medium (10 ml) and incubated at 37°C for 6 h. 
At the same time, 0.1 ml of the glycerol-containing stock of the AN 103 
genomic library was similarly inoculated and incubated. The growth of all 
three cultures was monitored by measuring OD^ and appropriate volumes 

20 combined in a centrifuge tube to give equal populations of the three 
organisms. The mixture of cells was centrifuged, resuspended in a 
minimal volume of the supernatant solution and spread over a sterile 
gridded cellulose nitrate membrane filter (47 mm diam., Whatman, 
Maidstone, Kent, UK) on a moist LB-Mod agar plate. The suspension 

25 was allowed to air-dry onto the filter for a few minutes and then incubated 
overnight at 25 °C. The bacteria were then washed from the filter using 2 
ml of H 2 0 and aliquots (0. 1 ml) were applied to selection plates consisting 
of MM agar with 10 mM vanillic acid and 5 jxg/ml tetracycline. These 
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were incubated at 25 °C for 2 days and the colonies obtained ( > 1000 per 
plate) were replica-plated to similar plates containing trans-feruhc acid in 
place of vanillic acid; these were incubated similarly. Colonies (2-3 per 
plate) able to grow on the plates containing trans-femlic acid were selected 

5 and inoculated into fresh MM medium containing 10 mM rrans-ferulic 
acid and 5 /xg/ml tetracycline. Four such isolates in which the mutation 
in the Ps. fluorescens strains vanlO and vanl 1 was complemented by the 
introduced cosmid were selected for further analysis. These strains were 
purified and the cosmid DNA was extracted by the mini-preparation 

io method of F.G. Grosveld, H.H.M. Dahl, E. Deboer and R. A. Flavell 
{Gene 13, 227-231 (1981)). The cosmid DNA was transformed into E. 
coli strain 803 and was again isolated as described by D.S. Holmes and 
M. Quigiey (Anal. Biochem. 114, 193-197 (1981)). Two of the cosmid 
clones, pFI 793 and pFI 794, were isolated as complementing Ps. 

15 fluorescens mutant strain van 10, whereas cosmid clones pFI 795 and pFI 
796 complemented Ps. fluorescens mutant strain van 1 1 . 

To test whether the plasmid clones pFI 793, pFI 794, pFI 795 and pFI 
796 would complement any of the other Ps. fluorescens mutants, each 

20 plasmid was introduced into Ps. fluorescens mutant strains vanl, van2, 
van3, vanlO and vanll. As described above in Example 3 the mutant 
strains vanl , van2 and van3 appear to be defective in a regulatory function 
that eliminates at least two different enzyme activities. The Ps. 
fluorescens van mutants were grown on MM04- 10 mM vanillic acid agar 

25 medium for two days at 25 °C. The strain carrying the helper plasmid 
(803pRK2013) was grown on LB-Mod agar + kanamycin (25 ^g/rnl) at 
37 °C overnight. The E.coli 803 cosmid clones carrying pFI 793, pFI 
794, pFI 795 and pFI 796 were grown on LB-Mod agar medium + 
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tetracycline (5 fig/ml) at 37°C overnight. The bacteria were patch mated 
(ie a looprul of donor, recipient and helper strain were mixed together on 
LB-Mod agar and incubated overnight at 25 °C). The bacteria were 
replica plated onto selection medium (MMO + 10 raM rranj-ferulic acid 
5 + tetracycline and MMO + 10 mM vanillic acid + tetracycline) and 
incubated at 25 °C for two days. All four cosmid clones pFI 793, pFI 
794, pFI 795 and pFI 796 complemented all of the van mutants (vanl, 
van2, van3, vanlO and vanl 1) enabling them to grow on fra/w-ferulate as 
sole carbon source. 

10 

The cosmid clone DNAs were analyzed by digestion with restriction 
endonucleases Hind III and Eco RI to reveal insert DNA. Each of the 
four clones pFI 793, pFI 794, pFI 795 and pFI 796 carried inserts of 
between 20 and 30 kb. The four cosmid clones gave distinct restriction 

15 patterns, but appeared to share some restriction fragments of the same 
size. To identify restriction fragments that were common to the cosmid 
clones cosmid pFI 793 was used as a probe against DNA of all four 
cosmid DNA preparations that had been double-digested with restriction 
endonucleases Eco RI and Pst I. Cosmid DNA was isolated using Qiagen 

20 midi columns according to the manufacturers instructions and was digested 
with restriction endonucleases £coRI and Pstl. The resulting fragments 
were separated by agarose gel electrophoresis and Southern blotted to a 
Hybond-N filter as described by Sambrook et al (Sambrook, J., Fitsch, 
E.F. and Maniatis, T. Molecular Cloning: A Laboratory Manual, 2nd 

25 edition, Cold Spring Harbor, NY, 1989). Cosmid pFI 793 DNA was 
linearised, denatured and labelled with digoxygenin prior to probing the 
Southern blotted DNA according to the instructions supplied by 
Boehringer (Lewes, Sussex, UK). The pFI 793 probe hybridised to all of 
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the EcoRI/PstI restriction fragments of pFI 795 indicating that these two 
clones are identical. Excluding the vector band at least six EcoRI/PstI 
fragments appeared to be common to pFI 793, pFI 794, pFI 795 and pFI 
796. These were fragments of 6.6kb, 2.9kb, 1.8kb, 1.4kb, 1.25kb and 
5 LI kb. The L25kb fragment appeared to be a doublet or triplet. The 
DNA fragment patterns of the four cosmid clones after restriction 
digestion with EcoRI/PstI are shown in Figure 9. For clarity the ethidium 
bromide stained DNA bands have been diagrammatically superimposed on 
the original agarose gel. 

to 

As described above in Example 4 protein analysis of Ps. fluorescens 
biovar. V, strain AN 103 showed that cells grown on mws-ferulate 
contained much larger amounts of a protein of about 31 kD compared with 
cells grown on vanillic acid. The twenty N -terminal amino acids of this 

15 protein were sequenced. This amino acid sequence was then used to 
design degenerate oligonucleotide primers that enabled the 60 bp sequence 
of DNA coding for this N-terminus to be amplified from pFI 793 by PCR 
(Figure 10). This sequence was used to probe EcoRI/PstI digests of the 
cosmid clones. By this technique the fragment containing the region of 

20 DNA encoding the 31 kD protein in each of the cosmid clones could be 
identified (Figure 11). This proved to be a fragment of 6kb in cosmids 
pFI 793 and pFI 795, a fragment of 4.3kb in cosmid pFI 794 and a 
fragment of 5.5kb in cosmid pFI 796. 

25 The 4.3 Kb EcoRI/PstI fragment of pFI 794 was sub-cloned into the E.coli 
vector pUC19 using strain XLI(Blue) and its nucleotide sequence was 
determined using an Applied Biosystems DNA Sequencer (Model 373; 
Perkin Elmer, Warrington, UK); together with the manufacturer s Taq 
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DyeDeoxy Terminator Cycle sequencing kit. A primer walking strategy 
was used with oligonucleotide primers being synthesized on an ABI 392 
Synthesizer (Perkin Elmer, Warrington, UK). The DNA sequence of the 
4.3kb fragment is presented in Figure 12, The open reading frame 

5 encoding the same amino-terminus as determined previously starts at 
position 2872 and is terminated by a stop codon at position 3700. This 
ORF of 828 bp encodes a protein 276 amino acids long with a molecular 
size of 31.010 kD in good agreement with the protein gel analysis. The 
translated amino- acid sequence of this gene is also presented in Figure 12. 

io In order to confirm the function of this gene it was sub-cloned and 
expressed in £. coli. From the DNA sequence PCR primers were 
designed to amplify the gene such that it was flanked by restriction 
endonuclease sites EcoRI and BamHl. The amplified gene retained its 
native ribosome binding site being initiated at base -29 and ending 6bp 

15 downstream of the stop codon. The amplified fragment was cloned into 
the equivalent sites of the £. coli expression vector pSP72 (Promega, 
Southampton, UK) and transformed into E. coli JM109(DE3). 

The E. coli 803 clones carrying the hydratase/cleavage enzyme gene, plus 
20 a putative promoter, as a PCR product in the vector pRK415 were used 
in a triparental patch mating experiment essentially as described earlier in 
relation to complementation by the cosmid clones pFI 793-6. The 
complemented van 10 strain was demonstrated to have recovered the 
ability to grow on fr<ms-ferulic acid, confirming directly that the mutation 
25 in van 10 resided in the gene encoding the t rans- feruloy] SCoA 
hydratase/cleavage enzyme. 
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The presence of a novel enzyme activity (c/. Example 4) in the £*. coli 
clone was demonstrated. E. coli cells were grown at 37°C for 3 h in 
50ml of L medium, containing ampicillin (50 /ag/ml) with and without 
induction by IPTG. Extracts were prepared as described above in 

5 Example 2 for Ps. fluoresceins AN 103, but without centrifiigation. The 
crude extract was used for assay. Enzyme activity with both HMPHP 
SCoA and mz/15-feruloyl ScoA was determined as described above in 
Example 4 using HPLC to determine the reaction products. The results 
presented in Table IX clearly demonstrate that vanillin and acetyl SCoA 

10 were produced in equimolar proportions both with rrans-feruloyl SCoA 
and with HMPHP SCoA as substrates. In addition, HMPHP SCoA was 
also dehydrated to feruloyl SCoA, putatively rrans-feruioyl SCoA, 
demonstrating the reverse of activity II. These results are closely similar 
to those obtained with the vanillin-forming cleavage enzyme purified from 

15 Pseudomonas fluorescens AN 103, although the ratio of activities with 
trans - feruloyl SCoA and HMPHP SCoA differs slightly between the two 
preparations. There was no activity with either rnms-feruloyi SCoA or 
HMPHP SCoA in extracts of an unmanipulated E. coli strain, whether 
induced or not. In the manipulated strain E. coli 1039 that expresses the 

20 Pseudomonas gene the specific activity was slightly lower in the extract 
made following induction than in that made from uninduced bacteria. 
Since the assay measures only active enzyme it is conceivable that 
increased protein expression occurs upon induction but this may result in 
incorrectly folded ana therefore inactive enzyme. It was not possible to 

25 detect expression of the 3 lkD protein visually on Coomassie-stained, one- 
dimensional SDS gels because of its co-migration with the strongly 
expressed jS-lactamase encoded by the vector ampicillin resistance marker. 
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The DNA downstream of the gene encoding the 31kD protein was targeted 
for cloning and sequencing and for analysis of additional open reading 
frames. A PCR-generated probe was used to identify an overlapping Xhol 
fragment of 1.5 kb. Sequencing from this fragment and subsequently 
5 directly from the parent cosmid clone pFI794 revealed a second open 
reading frame of 1449 bp beginning at base 3804. The translation of this 
nucleotide sequence revealed a polypeptide of 483 amino acids. 
Comparison with sequences in the databases revealed appreciable 
homology to salicylaldehyde: NAD + oxidoreductase. 

10 

In order to confirm the function of this gene, expression was determined 
in E. coli strain DH5, which contained the vector pUC18 into which the 
full-length open reading frame had been inserted such that expression was 
from the lac promoter on the vector. Vanillin: NAD + oxidoreductase 
15 activity was confirmed and was absent from a control strain bearing the 
unmodified pUC 18 vector. Using the enzyme assay described in 
Example 2, activity with vanillin as substrate was determined as 3.0 
nkat/mg of protein; activity with salicylaldehyde was 2.8 nkat/mg. 

20 Additional sequence analysis of DNA cloned from Ps. fluorescens AN 103 
was undertaken using cosmid clone pFI 793. The 1.8, 0.9 and 0.8 kb 
EcoRI/PstI fragments were sub-cloned into £. coli vector pUCl8 and their 
nucleotide sequences were determined. Sequencing the 0.9 kb sub-clones 
revealed that there are two different fragments of the same size. The 

25 nucleotide sequences of DNA fragments of ISSTbp, 960bp, 959bp and 
854bp in sub-clones pFI 901, pFI 912, pFI 913 and pFI 911 respectively 
are presented in Figures 13 to 16. Outward reading PCR primers were 
designed from the ends of each of the four sequences as shown in Figure 
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17. Use of these primers in all possible pairwise combination with pFI 
793 as template showed that the 1 .8 kb fragment of pFI 901 was separated 
from the 959bp fragment of pFI 913 by 1.5 kb on the cosmid DNA 
(Figure 18). Direct sequence analysis of this 1 .5 kb PCR product enabled 
5 this together with the 1.8 kb and 959bp fragments to be merged into one 
larger fragment of 4.3 kb (Figure 19). 

Example 6: Production of vanillin from /raws-ferulovl SCoA and 
enzyme activities II and III 

Trans- feruloyl SCoA was synthesised as described in Example 2, and was 
used as a substrate of the frans-feruloyl SCoA hydratase/aldol cleavage 
enzyme (ie a single polypeptide with enzyme activities II and III) as 
purified by the method described in Example 4. Vanillin was produced 
from trans- feruloyl SCoA. 

Example 7: A transgenic tobacco plant which produces vanillin 

Nicotiana tabacum (tobacco) is transformed using a strain of 
20 Agrobacterium tumefaciens which has been modified so mat it transfers the 
Ps. fluorescens gene encoding enzyme activities II and III (see Example 
5) to the tobacco plant. The tobacco plant produces vanillin in those parts 
of the plant which have rrans-feruloyl SCoA present, at least in the form 
of a vanillin glycoside. 

25 



10 



15 
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CLAIMS 

1 . A method of producing vanillin comprising the steps of 

5 (1) providing /ra/u-ferulic acid or a salt thereof; and 

(2) providing frans-ferulate:CoASH ligase activity (enzyme 
activity I), /ra/w-feruloyi SCoA hydratase activity (enzyme 
activity II), and 4-hydroxyO-methoxyphenyl-/3-hydroxy- 
propionyl-S-CoA (HMPHPSCoA) cleavage activity (enzyme 

10 activity III). 

2. A method according to Claim 1 wherein means for converting 
vanillin to a non-vanillin product is absent or reduced. 

15 3. A method according to Claim 1 wherein the enzyme activities I, II 
and III are provided by Pseudomonas fluorescens biovar. V, strain 
AN 103 as deposited under the Budapest Treaty at the National 
Collections of Industrial and Marine Bacteria Limited, Scotland 
under Accession No NCIMB 40783, or a mutant or variant thereof. 

20 

4. A method according to any one of Claims 1 to 3 wherein the 
enzyme activities I, II and III are provided by an intact-cell-free 
system of Pseudomonas fluorescens biovar. V, strain AN 103 as 
deposited under the Budapest Treaty at the National Collections of 
25 Industrial and Marine Bacteria Limited, Scotland under Accession 

No NCIMB 40783, or a mutant or variant thereof. 
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5. A method according to Claim 2 wherein means for converting 
vanillin to a non-vanillin product is an activity that interconverts 
vanillin and vanillic acid (enzyme activity IV). 



5 6. A method according to Claim 5 wherein NAD + is absent. 

7. A method according to Claim 1 or 2 further comprising the step of 

(3) providing any one of the cofactors Coenzyme ASH, ATP or 
io Mg 2+ , or other functionally equivalent cofactors. 

8. A method according to Claim 7 wherein either one of the cofactors 
Coenzyme ASH and ATP is recycled or generated. 

15 9. A method according to Claim 8 wherein Coenzyme ASH is 
recycled using the enzymes citrate synthase and citrate lyase. 

10. A method according to Claim 8 wherein ATP is generated using 
the enzymes- adenylate kinase and acetate kinase. 

20 

11. A method according to any one of Claims 1 to 10 wherein the 
mz/zs-ferulic acid or salt thereof is provided by action of trans- 
ferulic acid esterase on plant material, said plant material 
containing an ester of rra/w-ferulic acid. 

25 

12. A method according to any one of Claims 1 to 1 1 wherein at least 
one of the enzyme activities II or III is provided by a substantially 
purified enzyme. 
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13. A method according to any one of Claims 1 to 12 further 
comprising the step of providing a compound other than trans- 
ferulic acid or a salt thereof which may be converted by any one of 
enzyme activities I, II or III into a desirable product. 

5 

14. A method according to Claim 13 wherein said compound is any of 
rrans-4-coumaric acid or a salt thereof, rrans-4-coumaroyl S CoA, 
rram-caffeic acid or a salt thereof, rra/w-caffeoyl SCoA or 3,4- 
methylene dioxy-zrans-cinnamic acid or a salt thereof. 

10 

15. A method according to Claim 14 wherein said compound is trans-A- 
coumaric acid or a salt thereof or rrans~4-coumaroyl SCoA. 



16. A method of producing vanillic acid, or a salt thereof, comprising 
15 the steps as defined in any one of Claims 1, 3, 4 and 7 to 12 and 

the further step of providing enzyme activity IV. 



17. A method according to any one of Claims 1 to 12 comprising the 
further step of separating vanillin from the other reaction 
20 components. 



18. A method according to Claim 16 comprising the further step of 
separating vanillic acid, or a salt thereof, from the other reaction 
components. 

25 

19. Pseudomonas fluorescens biovar V, strain AN 103 as deposited 
under the Budapest Treaty at the National Collections of Industrial 
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and Marine Bacteria Limited, Scotland under Accession No 
NCIMB 40783, or a mutant or variant thereof. 

20. A mutant of Pseudomonas fluorescens biovar V, strain AN 103 
5 according to Claim 18 which accumulates vanillin when provided 

with ms/is-ferulic acid or a salt thereof. 

21. A polypeptide which, in the presence of appropriate cofactors if 
any, is capable of catalysing the interconversion of rrows-feruloyl 

10 S CoA and 4-hydroxy-3-methoxypheny 1-0-hydroxy-propionyl SCo A 

(HMPHP SCoA). 

22. A polypeptide according to Claim 21 which is a frans-feruloyl S 
CoA hydratase. 



15 



20 



23. A polypeptide which, in the presence of appropriate cofactors if 
any, is capable of catalysing the interconversion of 4-hydroxy-3- 
methoxypheny!-/3-hydroxy-propionyl SCoA (HMPHP SCoA) and 
vanillin. 

24. A polypeptide according to Claim 23 which is a HMPHP SCoA 
cleavage enzyme. 



25. A polypeptide which, in the presence of appropriate cofactors if 
25 any, is capable of catalysing the interconversion of rrans-feruloyl- 

S-CoA and vanillin via HMPHP SCoA. 
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26. A polypeptide according to any one of Claims 21, 23 and 25 
comprising the amino acid sequence 



MetSerThrTyrGluGlyArgTrpLysThrValLysValGluIleGluAspGlylleAla 
5 PheVallleLeuAsnArgProGluLysArgAsnAlaMetSerProThrLeuAsnArgGlu 
MetlleAspValLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValLeuThr 
GlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheArgGluValAsp 
AlaGlyProGIuIleljeuGlnGluLysIleArgArgGluAlaSerGlnTrpGlnTrpLys 
LeuLeuArgMetTyrAlaLysProThrlleAlaMetValAsnGlyTrpCysPheGlyGly 

10 GlyPheSerProLeuValAlaCysAspLeuAlalleCysAlaAspGluAlaThrPheGly 
LeuSerGluIleAsnTrpGlylleProProGlyAsnLeuValSerLysAlaMetAlaAsp 
ThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrPheGlyGlyGln 
LysAlaAlaGluMetGlyLeuValAsnGluSerValProLeuAlaGlnlieuArgGluVal 
Thr 1 1 eG luLeuAl aAr gAsnLeuLeuG 1 uhy s AsnProVa 1 Va 1 LeuArgAl aAl a Ly s 

15 HisGlyPheLysArgCysArgGluLeuThrTrpGluGlnAsnGluAspTyrLeuTyrAla 
LysLeuAspGlnSerArgLeuLeuAspThrGluGlyGlyArgGluGlnGlyMetLysGln 
PheLeuAspAspLysSerlleLysProGlyLeuGlnAlaTyrLysArg 



(SEQ ID No 2) or a fragment or variant thereof. 

20 

27. A vanillin: NAD + oxidoreductase comprising the amino acid 
sequence 

MetLeuAspValProLeuLeuIleGlyGlyGlnSerCysProAlaArgAspGlyArgThr 
PheGluArgArgAsnProValThrGlyGluLeuValSerArgValAlaAlaAlaThrLeu 
25 GluAspAlaAspAlaAlaValAlaAlaAlaGlnGlnAlaPheProAlaTrpAlaALaLeu 
AlaProAsnGluArgArgSerArgLeuLeuLysAlaAlaGluGlnLeuGlnAlaArgSer 
GlyGluPhelleGluAlaAlaGlyGluThrGlyAlaMetAlaAsnTrpTyrGlyPheAsn 
ValArgLeuAlaAlaAsnMetLeuArgGluAlaAlaSerMetThrThrGlnValAsnGly 
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GluVallleProSerAspValProGlySerPheAlaMetAlaLeuArgGlnProCysGly 
ValValLeuGlylleAlaProTrpAsnAlaProVallleLeuAlaThrArgAlalleAla 
MetProLeuAlaCysGlyAsnThrValValLeuLysAlaSerGluLeuSerProAlaVal 
HisArgLeuIleGlyGlnValLeuGlnAspAlaGlyLeuGlyAspGlyValValAsnVal 
IleSerAsnAlaProAlaAspAlaAlaGlnlleValGluArgLeuIleAlaAsnProAla 
ValArgArgValAsnPheThrGlySerThrHisValGlyArglleValGlyGliiLeuSer 
AlaArgHisLeuLysProAlaLeuLeuGlul#euGlyGlyLysAlaProLeuLeuValLeu 
AspAspAlaAspLeuGluAlaAlaValGlnAlaAlaAlaPheGlyAlaTyrPheAsnGln 
GlyGlnlleCysMetSerThrGluArgLeuIleValAspAlaLiysValAlaAspAlaPhe 
ValAlaGlnLeuAlaAlaLysValGluThrLeuArgAlaGlyAspProAlaAspProGlu 
SerValLeuGlySerljeuValAspAlaSerAlaGlyThrArglleLysAlaLeuIleAsp 
AspAlaValAlaLysGlyAlaArgLeuVallleGlyGlyGlnLeuGluGlySerlleLeu 
GlnProThrLeuLeuAspGlyValAspAlaSerMetArgLeuTyrArgGluGluSerPhe 
GlyProValAlaValValLeuArgGlyGluGlyGluGluAlaLeuLeuGlnl^euAlaAsn 
AspSerGluPheGlyLeuSerAlaAlallePheSerArgAspThrGlyArgAlaLeuAla 
LeuAlaGlnArgValGluSerGlylleCysHisIleAsnGlyProThrValHisAspGlu 
AlaGlnMetProPheGlyGlyValLysSerSerGlyTyrGlySerPheGlyGlyLysAla 
SerlleGluHisPheThrGlnLeuArgTrpValThrLeuGinAsnGlyProArgHisTyr 
Prolle 

(SEQ ID No 4) or a fragment or variant thereof. 

A polypeptide as defined in any one of Claims 21 to 27 which is 
substantially pure. 



A polynucleotide encoding a polypeptide as defined in any one of 
Claims 21 to 27. 
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A polynucleotide comprising at least a part of the Pseudomonas 
fluoresceins DNA contained within the cosmid clone pFI 793 as 
deposited under the Budapest Treaty at the National Collections of 
Industrial and Marine Bacteria Limited. Scotland under Accession 
No NCIMB 40777, or a fragment or variant thereof. 

Cosmid pFI 793 as deposited under the Budapest Treaty at the 
National Collections of Industrial and Marine Bacteria Limited, 
Scotland under Accession No NCIMB 40779. 

A polynucleotide according to Claim 30 encoding a polypeptide as 
defined in any one of Claims 21 to 27. 

A polynucleotide according to any one of Claims 29, 30 or 32 
comprising the nucleotide sequence 

ATGAGCACATACGAAGGTCGCTGGAAAACGGTCAAGGTCGAAATCGAAGACGGCATCGCG 

TTTGTCATCCTCAATCGCCCGGAAAAACGCAACGCGATGAGCCCGACCCTGAACCGCGAG 

ATGATCGATGTTCTGGAAACCCTCGAGCAGGACCCTGCCGCCGGTGTGCTGGTGCTGACC 

GGTGCGGGCGAAGCCTGGACCGCAGGCATGGACCTCAAGGAATACTTCCGCGAAGTGGAC 

GCCGGCCCGGAAATCCTCCAGGAAAAAATCCGCCGCGAAGCCTCGCAATGGCAATGGAAA 

CTGCTGCGCATGTACGCCAAGCCGACCATCGCCATGGTCAATGGCTGGTGCTTCGGCGGC 

GGTTTCAGCCCGCTGGTGGCCTGCGACCTGGCGATCTGCGCCGACGAAGCAACCTTCGGT 

CTCTCGGAAATCAACTGGGGTATCCCGCCGGGCAACCTGGTGAGCAAGGCCATGGCCGAC 

ACCGTGGGCCACCGCCAGTCGCTCTACTACATCATGACCGGCAAGACCTTCGGTGGGCAG 

AAAGCCGCCGAGATGGGCCTGGTCAACGAAAGCGTGCCCCTGGCGCAACTGCGCGAAGTC 

ACCATCGAGCTGGCGCGTAACCTGCTCGAAAAAAACCCGGTGGTGCTGCGTGCCGCCAAA 

CACGGTTTCAAACGCTGCCGCGAACTGACCTGGGAGCAGAACGAGGATTACCTGTACGCC 

AAGCTCGATCAGTCGCGTTTGCTGGACACCGAAGGCGGTCGCGAGCAGGGCATGAAGCAA 
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TTCCTCGACGACAAGAGCATCAAGCCTGGCCTGCAAGCGTATAAACGC 

(SEQ ID No 1) or a fragment or variant thereof. 

5 34. A polynucleotide according to any one of Claims 29, 30 or 32 
comprising the nucleotide sequence 

ATGCTGGACGTGCCCCTGCTGATTGGCGGCCAGTCGTGCCCCGCGCGCGACGGTCGAACC 
TTCGAGCGCCGCAACCCGGTGACTGGCGAGTTGGTGTCGCGGGTTGCCGCCGCCACCCTG 
GAAGATGCCGACGCCGCCGTGGCCGCTGCCCAGCAAGCGTTTCCCGCGTGGGCCGCGCTG 

1 0 GCGCCCAATGAACGGCGCAGCCGTTTGCTCAAGGCCGCCGAACAATTGCAGGCGCGCAGC 
GGCGAGTTCATCGAGGCGGCGGGCGAGACCGGCGCCATGGCCAACTGGTACGGGTTCAAC 
GTACGGCTGGCGGCCAACATGCTGCGTGAAGCGGCATCGATGACCACCCAGGTCAATGGT 
GAAGTGATTCCCTCGGACGTTCCCGGCAGTTTCGCCATGGCCCTGCGCCAGCCCTGTGGC 
GTGGTGCTGGGCATCGCCCCCTGGAACGCCCCGGTGATTCTCGCCACCCGGGCGATTGCC 

15 ATGCCGCTGGCCTGTGGCAACACCGTGGTGCTGAAGGCTTCCGAGCTGAGTCCGGCGGTG 
CATCGCTTGATCGGCCAGGTGCTGCAGGACGCCGGCCTGGGCGATGGCGTGGTCAACGTC 
ATCAGTAATGCGCCGGCGGATGCGGCACAGATTGTCGAGCGCCTGATTGCCAACCCGGCC 
GTACGCCGGGTCAATTTCACCGGTTCGACCCACGTCGGGCGCATTGTCGGCGAGCTCTCG 
GCGCGCCACCTCAAACCGGCGTTGCTCGAGCTGGGCGGCAAGGCACCGTTGCTGGTGCTC 

20 GACGATGCCGACCTGGAGGCTGCCGTGCAGGCGGCGGCGTTTGGCGCCTACTTCAACCAG 
GGACAGATCTGTATGTCCACCGAGCGCCTGATTGTCGATGCCAAGGTGGCCGACGCCTTT 
GTCGCCCAGTTGGCGGCCAAGGTCGAGACCCTGCGCGCCGGTGATCCTGCCGACCCGGAG 
TCGGTGCTCGGTTCGCTGGTGGACGCCAGCGCTGGCACGCGGATCAAAGCGTTGATCGAT 
GATGCCGTGGCCAAGGGCGCGCGCCTGGTAATCGGCGGGCA^.CTGGAGGGCAGCATCTTG 

25 CAGCCGACCCTGCTCGACGGTGTCGACGCGAGCATGCGTTTGTACCGCGAAGAGTCCTTC 
GGCCCGGTGGCGGTGGTGCTGCGCGGCGAGGGCGAAGAAGCGCIXjITGCAACTGGCCA^ 
GACTCCGAGTTCGGTTTGTCGGCGGCGATTTTCAGTCGTGACACCGGCCGTGCCCTGGCC 
CTGGCCCAGCGGGTCGAATCGGGCATCTGCCACATCAACGGCCCGACCGTGCACGACGAA 
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GCGCAAATGCCTTTTGGCGGGGTCAAGTCCAGCGGCTACGGCAGTTTTGGCGGCAAGGCA 
TCGATTGAGCATTTCACTCAGTTGCGCTGGGTCACCCTCCAGAATGGTCCACGGCACTAT 
CCGATC 

5 (SEQ ID No 3) or a fragment or variant thereof. 

35. A nucleic acid vector comprising a polynucleotide according to any 
one of Claims 28 to 34. 

10 36. A host cell comprising a polynucleotide as defined in any one of 
Claims 28 to 34. 

37. A host cell according to Claim 36 comprising nucleic acid which 
encodes any one of the polypeptides as defined in Claims 21, 23 

15 and 27. 

38. A host cell according to Claim 36 or 37 which is a bacterium or 
yeast. 

20 39. A host cell according to Claim 38 which is a food-grade bacterium 
or yeast. 

40. A host cell according to Claim 39 which is a Lactococcus spp. or 
a Lactobacillus spp. or Saccharomyces cerevisiae or a biovar 

25 thereof. 

41. A host cell according to Claim 37 which is a plant cell. 
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42. A host cell according to Claim 41 wherein said plant cell is a cell 
from any one of Nicotiana spp. , Solatium tuberosum, Brassica spp. , 
Beta spp.. Capsicum spp. and Vanilla spp. 

5 43. A host cell according to Claim 41 or 42 wherein said cell is 
comprised in a plant. 

44. A transgenic plant comprising at least one polynucleotide according 
to Claim 29 and which, as a consequence of the presence of said 

10 polynucleotide, expresses any of the enzyme activities II and III. 

45. A transgenic plant according to Claim 45 which, as a consequence 
of the presence of said polynucleotides, expresses the enzyme 
activities II and III. 

15 

46. A transgenic plant according to Claim 44 or 45 wherein said plant 
is selected from Nicotiana spp. , Solanum tuberosum, Brassica spp. , 
Beta spp., Capsicum spp. and Vanilla spp. 

20 47. A method according to Claims 1 to 18 wherein the enzyme 
activities II and III are provided by the host cell according to any 
of Claims 36 to 43 or a transgenic plant according to any one of 
Claims 44 to 46, or an extract thereof. 

25 48. A method according to Claims 1 to 18 wherein the enzyme 
activities I, II and III are provided by a microorganism which can 
convert fra/w-ferulic acid to vanillin. 
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49. Use of Pseudomonas fluorescens biovar. V, strain AN103 or a 
mutant or derivative thereof for producing vanillin, or vanillic acid 
or salt thereof. 



5 50. Use of a polypeptide according to any one of Claims 21 to 28 for 
producing vanillin, or vanillic acid or salt thereof. 



51. Use of a polynucleotide according to any one of Claims 29 to 35 
for producing vanillin, or vanillic acid or salt thereof. 

10 

52. Use of a host cell according to any one of Claims 36 to 43 for 
producing vanillin, or vanillic acid or salt thereof. 



53. Use of a transgenic plant according to any one of Claims 44 to 46 
15 for producing vanillin, or vanillic acid or salt thereof. 

54. Use of Pseudomonas fluorescens biovar V, strain AN 103 or a 
mutant or derivative thereof for converting a compound, other than 
fraws-ferulic acid or a salt thereof, into a desirable product. 

20 

55. Use of a polypeptide according to any one of Claims 21 to 28 for 
converting a compound into a desirable product. 

56. Use of a polynucleotide according to any one of Claims 29 to 35 
25 for converting a compound into a desirable product. 

57. Use of a host cell according to any one of Claims 36 to 43 for 
converting a compound into a desirable product. 
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58. Use of a transgenic plant according to any one of Claims 44 to 46 
for converting a compound, other than /rans-ferulic acid or a salt 
thereof, into a desirable product. 

5 59. Use according to any one of Claims 54 to 58 wherein said desirable 
product is a flavour or aroma. 

60. Use according to Claims 54 to 59 wherein said compound is any 
one of rra/25-4-coumaroyl S Co A and rrans-caffeoyl SCoA. 

10 

61 . A food or beverage comprising a host cell according to any one of 
Claims 36 to 43, or an extract thereof. 

62. A food or beverage comprising a transgenic plant according to 
15 Claim 44 to 46, or a part or extract thereof. 

63. A method of producing vanillin comprising 

(1) providing /ratts-feruloyl SCoA; and 

20 

(2) providing rra/is-feruloyl SCoA hydratase activity (enzyme 
activity II), and HMPHP SCoA clevage activity (enzyme 
activity III). 

25 64. Any novel method of producing vanillin or vanillic acid 
substantially as disclosed herein. 

65. Any novel polypeptide substantially as disclosed herein. 
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66. Any novel polynucleotide substantially as disclosed herein. 
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Figure 7 
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Fig 8. 




97.4 kDa Phosphorylase b 
66.2 kDa Bovine serum albumin 
45.5. kDa Hen egg white ovalbumin 
3 1 .0 kDa Bovine carbonic anhyclrase 

2 1 .5 kDa Soybean trypsin inhibitor 
1 4.4. kDa Hen egg white lysozyme 
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Nucleotide ar>d amino acid sequence of the cleavage en zyme and vanillin dehydrogenase, 

Sequence determined from sub-cloned fragments pFI989 and pFI1056 and from the cosmid 

clone pFT794 

1 GAATTCTCTCGCGCTTTGCCCAGTCCTACCCGCTGGTGCAGATCGAGGTG 5 0 

5 1 CATTGCGAGTCGTCCAAGCAACTTCTCTTGCGCCAGGACCTCGACCTGTC 10 0 

101 C ATCGTG ACCCGCG AACCCGGC AACG AAATCGGCC AGCTGTTGCGC AAGG 150 

151 AGCGTTTTGTCTGGGCCCAGGCCCAGTGCTACAACCCTGTCGAGCAATCA 200 

201 CCCTTGCCGCTGGCGATGTTCAACAGTGACTGCTTCTGCCGTCTTTGGGC 250 

2 51 CTGTAATGCGCTGGATGCCGCCGGACGTGAATACCGCATCGCCT ACAAC A 300 

3 01 GTTCGAGCCTGTCGGCGCTGATGGCAGTGGTCAGCGCCGGGCTGGCGGTG 350 

3 51 ACCGCTCAATTGGAAAGCCTGATCCCGCAGGACATGCGCATCCTCGGCGA 400 

4 01 GGCCGAAGGCCTGCCCCAACTGCCCGAGGCGAGCATCATGCTGCTGCGCA 4 50 
4 51 ATCTGC ATAATCCGTCGCCG ATT ACCGAATGTCTGGCGGAGC AC ATCGTC 500 
501 GAAGGCTTC AAACTTTAAAGGCG AG CAT C AC CGCGC AG A GCACC AG AAAA 550 
551 CCGC AGAAC AAACCGCGT AGT AGCCGCTCCGGCAGGGCGTGGGC AATCTT 600 
601 CACGCCCCAACTGATACTG AGCAGACCGCCC ACCGCCATGGGCAACGCGA 650 
651 TGTGCC AGTCGACTTGCTGGTGC AGGGCGT AGGTCGCC AGGGTTAC ACCG 7 00 
7 01 GTGCTGGGTAAAGCCAGTGCCAACGACAGGCCCTGGGCCACCACCTGGGT 7 50 
7 51 GGTGCCAAACAAGCTGGTCAATACCGGCGTTGCGACCAC AGCGCCGCCCA 800 
801 CCCCGAACAAGCC ACCCATG ACGCCGGACGCCGCGCCCAGC ACCCCGAGC 8 50 
851 C AGGGCCACGAATAGCGC ATCTGCGC AGTCGGTGCCGCCGC AGTCATGAA 900 
901 C ATGCGCATCAGGTTGTAGACCGACAAGGCC ACCAGAAACACCACGAACC 9 50 
9 51 CGATGCGCATCACCTG AGCGTCGATCCCCACCGCCCAGATCGAACCGAGC 1 0 0 0 

10 01 C AGGCAAAGC AGAACCCC ATGG AGGCC AACGGCAGCGCGTGGCGCAACTC 1050 

10 51 G ATGCGATTACGTTGGTGATAACGCCATAACGCCAGCATCACGTTCGGCA 1100 

1101 CCACCATGACC AG AGCTGTGCCCTGGGCAAGCTGCTGATCC AGGCC AAAT 1150 

1151 A AC ACGCCCAGAGCGGG AATGGCGATC AAGCCACCGCCGATTCCAAACAA 12 00 

1201 ACCACCTACGGTCCCCAAGGCTGC ACCG AGCAGCAGGT AC ATCGTC AACT 12 50 

12 51 C AATCACAGGTCAA ATTCCCTC ACGTCAATGGGTGCATCCTACGCAGTCG 13 00 

Figure 12 (Part 1 of 5) 
SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9735999A2J_> 



WO 97/35999 PCT/GB97/00809 



13/27 

13 01 GGGCTAGCGGGGAAACGC AC AGCAACGCAC AATGGCTATGCC AAATTCGC 13 50 

13 51 ACAAGCAATCCTCCATGAACCCCACAACGCTCACCGATCAATTGGGTCTA 14 0 0 

14 01 TTCCTTG ATGTCGTGGAAACCGGC AGTTTTTCCGCTGCGTCCCG ACGCCA 14 50 

14 51 TCCGCTGACACCCTCCGCCGTCGCCAGGCGTATCGATAGTCTGGAACAGG 1500 
1501 CGCTCGACAGCC AATTGTTCGTGCGC ACC ACTC ATGCTGTGCGTCCTACG 1550 

15 51 CCAGCGGGATTGGCGTTTGCCGAGCGAGCCCGACGCATTGTCGGCGAGTT 16 0 0 
1601 GCGCCTGGCGCGGGCCG AGGTCGCCTCCCTG AGCAGCGCGCCTGAAGG AC 1650 
1651 TG ATTCGGGTCG ACGCCCCCGC AGCCTTCGGCCGC AGGC ACCTGGCGCCG 17 00 
1701 GTGATCTATGACT ACG ACTTCGCCGCCTCCGGCGTGCCCGGGCTGCGGGG 17 50 
17 51 CCGGTTGCGCTACCTGCGCGGCGAC AAC ATCGAGTTGAAAGCCTTC AACG 1800 
1801 CCGAAGACCGCAAGG AGCGCGAGTTCCAGATGGAGCTGGGCTACGTGGTG 18 50 
1851 C AAAGCGGTCCGCTGAAAAACGTCGGCCTGGTGGCGCGCAAGGCAATCTA 1900 
1901 CCGCAATGACTTCCCCACTGGCGCCGCCTTCCGCGATGAAAACCAGACGC 19 50 
1951 GGTTTCTGGTGACCTATACCTTGCCGATCTGGTGAGTGCGCGTGTTGCGG 2 000 
2 001 TGGGGCTGATGGCCCCATCGCGAGCGGGCTCGCTCCTACAGTGGGTTTGG 2050 
2 051 TGTTAATCACAGAGGCTGTGGAGCTTGCAGCCCCTGTAGGCGCTGGCTTG 2100 
2101 CCAGCGAGGCGTAGGCACTGCTGGCGCAAGGCTCAAGGCCCCACAGGCCC 2150 
2151 GCTCCC ACCCTTCAGATTTTCTATTCCTGATAAATCTTCTTCAGCAGCCG 2200 
22 01 CAGCAGCTCGTCGCGTTCCTGGTCGTCC AGTGCCGAGGTGGCGTCGAGGT 2 2 50 
22 51 CGCTTTGGGCGGCG ATCTGGTTCAGTTCCTTGAGCAGGGTCTCGCCGGTC 2 3 00 
2 3 01 TTGCTG AGG AATATCCCGT ACG AGCGCTTGTCCGGCTTGC AGCGC ACACG 2 3 50 
2 3 51 CACCGCCAGCGCCCGGCTTTCCAGCTTATTCAGCAGCGGTACC ACCTGGG 2 4 00 
2 4 01 GCGGCTCGATGCTCAGAGCCCGGGCCAGGTCGGCCTGCATCAGGCCGGGG 2 4 50 
2 4 51 TTCTGATTGATG ATCGCCAGCGCCG AGAATTGCGCGGGGCGCAGATCGTG 2 50 0 
2 501 GGCCGAGAGGCGGCTG ATC AGGTTCTGGAACAGTTTCAGTTGCGC ACGGC 2 5 5 0 
2 5 51 GC ATGGCGTAGCCGATCAGATCATTCAGCGCCGAATCCATGGGCGCCTGG 2 6 0 0 
2 6 01 GTCTCGGCGGGAGTCGACGC AGCCTCGACCGACTCGGCGAGGGGGGAGGG 2 65 0 
2 6 5 i CTTGGCC ATTGCGGGGAAGTCCTG AAGATGGAGGTT AACAAGACTATCTA 17 0 0 
2 -> 0 1 GTTTGCCGACCTTGGCCGGTGATTGCTACGGCCAATATCGCTCGGCGCCA 2 7b'.- 
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27 51 AGACCGACCAGTCCATCACCTGCGAGAAAATTGGTTAAATCAATTAATAG 2800 

2 8 01 TTAATTGACATAACTAATTCGCTGCTTTAATTTCGAGTC ATCTTCAAAAC 2850 

2 8 51 CCAGAACAAGAGAGCATCGCCATGAGCACATACGAAGGTCGCTGGAAAAC 2 900 

MetSerThrTyrGluGlyArgTrpLysTh 

2 901 GGTCAAGGTCGAAATCGAAGACGGCATCGCGTTTGTCATCCTCAATCGCC 2 9 50 

rValLysValGluIleGluAspGlylleAlaPheVallleLeuAsnArgP 

29 51 CGG AAAAACGC AACGCG ATG AGCCCG ACCCTGAACCGCG AG ATGATCG AT 3000 
ToGluLysArgAsnAlaMetSerProThrLeuAsnArgGluMetlleAsp 

3 001 GTTCTGGAAACCCTCGAGC AGGACCCTGCCGCCGGTGTGCTGGTGCTGAC 3 050 

ValLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValLeuTh 

3 051 CGGTGCGGGCGAAGCCTGGACCGCAGGCATGGACCTCAAGGAATACTTCC 3100 
TGlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheA 

3101 GCGAAGTGGACGCCGGCCCGGAAATCCTCC AGGAAAAAATCCGCCGCGAA 3150 
rgGluValAspAlaGlyProGluIleLeuGlnGluLysIleArgArgGlu 

3151 GCCTCGC AATGGCAATGGAAACTGCTGCGC ATGTACGCCAAGCCGACCAT 3200 
AlaSerGlnTrpGlnTrpLysLeuLeuArgMetTyrAlaLysProThrll 

3201 CGCCATGGTCAATGGCTGGTGCTTCGGCGGCGGTTTCAGCCCGCTGGTGG 3250 
eAlaMetValAsnGlyTrpCysPheGlyGlyGlyPheSerProLeuValA 

32 51 CCTGCGACCTGGCGATCTGCGCCGACGAAGC AACCTTCGGTCTCTCGGAA 3300 
laCysAspLeuAlalleCysAlaAspGluAlaThrPheGlyLeuSerGlu 

3 3 01 ATC AACTGGGGTATCCCGCCGGGC AACCTGGTGAGCAAGGCCATGGCCGA 3 3 50 
IleAsnTrpGlylleProProGlyAsnLeuValSerLysAlaMetAlaAs 

3 3 51 C ACCGTGGGCCACCGCCAGTCGCTCT ACTACATCATGACCGGC AAGACCT 34 00 
pThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrP 

3 401 TCGGTGGGC AGAAAGCCGCCGAGATGGGCCTGGTC AACGAAAGCGTGCCC 3 4 5 0 
heGlyGlyGlnLysAlaAlaGluMetGlyLeuValAsnGluSerValPro 

3 4 51 CTGGCGCAACTGCGCGAAGTCACCATCGAGCTGGCGCGTAACCTGCTCGA 3 5 00 
LeuAlaGlnLeuArgGluValThr I leGluLeuAlaArgAsnLeuLeuGl 

3 501 AAAAAACCCGGTGGTGCTGCGTGCCGCC AAACACGGTTTC AAACGCTGCC 3 5 5 0 
uLysAsnProValValLeuArgAlaAlaLysHisGlyPheLysArgCysA 

3 5 51 GCGAACTG ACCTGGG AGCAGAACGAGGATT ACCTGTACGCCAAGCTCGAT 3600 
rgGluLeuThrTrpGluGlnAsnGluAspTyrLeuTyrAlaLysLeuAsp 

3 6 0] CAGTCGCGTTTGCTGGACACCGAAGGCGGTCGCGAGC AGGGCATGAAGCA 3650 
GlnSerArgLeuLeuAspThrGluGlyGlyArgGluGlnGlyMetLysGl 

3 6 51 ATTCCTCG ACGACA AGAGCATC AAGCCTGGCCTGCAAGCGTATAAACGCT 37 00 
nPheLeuAspAspLysSerlleLysProGlyLeuGlnAlaTyrLysArgE 

3 7 0 1 GAAGG ACGACGCTGCGGGCGCATTGCGCGAAGGCGAGTGCGCCCTGAAGC 3 7 5 0 
nd 
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37 51 TGCGTTTACATCACTGCTAAGCATTCCGATAAAG ACGATAAAGAGGAATC 3800 

3 801 ACCATGCTGGACGTGCCCCTGCTGATTGGCGGCCAGTCGTGCCCCGCGCG 3 8 50 
MetLeuAspValProLeuLeuIleGlyGlyGlnSerCysProAlaAr 

3 8 51 CGACGGTCG AACCTTCG AGCGCCGC AACCCGGTG ACTGGCG AGTTGGTGT 3 9 00 
gAspGlyArgThrPheGluArgArgAsnProValThrGlyGluLeuValS 

3 9 01 CGCGGGTTGCCGCCGCCACCCTGG AAG ATGCCG ACGCCGCCGTGGCCGCT 3 9 50 
erArgValAlaAlaAlaThrLeuGluAspAlaAspAlaAlaValAlaAla 

3 9 51 GCCC AGC AAGCGTTTCCCGCGTGGGCCGCGCTGGCGCCCAATG AACGGCG 4 000 

AlaGlnGlnAlaPheProAlaTrpAlaAlaLeuAlaProAsnGluArgAr 

4001 CAGCCGTTTGCTCAAGGCCGCCGAACAATTGCAGGCGCGCAGCGGCGAGT 4 0 50 
gSerArgLeuLeuLysAlaAlaGluGlnLeuGlnAlaArgSerGlyGluP 

4051 TC ATCG AGGCGGCGGGCGAGACCGGCGCC ATGGCC AACTGGT ACGGGTTC 4100 
helleGluAlaAlaGlyGluThrGlyAlaMetAlaAsnTrpTyrGlyPhe 

4101 AACGTACGGCTGGCGGCCAACATGCTGCGTGAAGCGGCATCGATGACCAC 415 0 
AsnValArgLeuAlaAlaAsnMetLeuArgGluAlaAlaSerMetThrTh 

4151 CCAGGTCAATGGTGAAGTGATTCCCTCGGACGTTCCCGGC AGTTTCGCC A 42 00 
rGlnValAsnGlyGluVallleProSerAspValProGlySerPheAlaM 

4 201 TGGCCCTGCGCC AGCCCTGTGGCGTGGTGCTGGGC ATCGCCCCCTGGAAC 4250 

etAlaLeuArgGlnProCysGlyValValLeuGlylleAlaProTrpAsn 

4251 GCCCCGGTGATTCTCGCC ACCCGGGCGATTGCC ATGCCGCTGGCCTGTGG 4 3 00 
AlaProVallleLeuAlaThrArgAlalleAlaMetProLeuAlaCysGl 

4 301 CAACACCGTGGTGCTGAAGGCTTCCGAGCTGAGTCCGGCGGTGCATCGCT 4 3 5 0 
yAsnThrValValLeuLysAlaSerGluLeuSerProAlaValHisArgL 

4 3 51 TGATCGGCCAGGTGCTGCAGGACGCCGGCCTGGGCGATGGCGTGGTCAAC 4 4 0 0 
eulleGlyGlnValLeuGlnAspAlaGlyLeuGlyAspGlyValValAsn 

4 401 GTCATC AGTAATGCGCCGGCGGATGCGGCACAGATTGTCGAGCGCCTGAT 4 4 5 0 
VallleSerAsnAlaProAlaAspAlaAlaGlnlleValGluArgLeuIl 

4 4 51 TGCCAACCCGGCCGTACGCCGGGTCAATTTCACCGGTTCGACCCACGTCG 4 50 0 
GAlaAsnProAlaValArgArgValAsnPheThrGlySerThrHisValG 

4 501 GGCGCATTGTCGGCGAGCTCTCGGCGCGCCACCTC AAACCGGCGTTGCTC 4 5 50 
lyArglleValGlyGluLeuSerAlaArgHisLeuLysProAlaLeuLeu 

4 551 GAGCTGGGCGGCAAGGCACCGTTGCTGGTGCTCGACG ATGCCG ACCTGG A 4 60 0 
GluLeuGlyGlyLysAlaProLeuLeuValLeuAspAspALaAspLeuGl 

4 601 GGCTGCCGTGCAGGCGGCGGCGTTTGGCGCCTACTTCAACCAGGGACAGA 4 650 
uAlaAlaValGlnAlaAlaAlaPheGlyAlaTyrPheAsnGlnGlyGlnl 

4 6 51 TCTGTATGTCCACCGAGCGCCTGATTGTCGATGCC AAGGTGGCCGACGCC 4 7 0 n 
leCysMetSerThrGluArgLeuI leVa lAspAlaLysVal AlaAspA la 
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4 7 01 TTTGTCGCCC AGTTGGCGGCCAAGGTCGAGACCCTGCGCGCCGGTGATCC 47 50 
PheValAlaGlnLeuAlaAlaLysValGluThrLeuArgAlaGlyAspPr 

4 7 51 TGCCGACCCGGAGTCGGTGCTCGGTTCGCTGGTGG ACGCC AGCGCTGGC A 4 800 
oAlaAspProGluSerValLeuGlySerLeuValAspAlaSerAlaGlyT 

4 801 CGCGG ATCAAAGCGTTGATCGATGATGCCGTGGCCAAGGGCGCGCGCCTG 4 850 
hrArglleLysAlaLeuIleAspAspAlaValAlaLysGlyAlaArgLeu 

4 8 51 GTAATCGGCGGGCAACTGGAGGGCAGCATCTTGCAGCCGACCCTGCTCGA 4900 
VallleGlyGlyGlnLeuGluGlySerlleLeuGlnProThrLeuLeuAs 

4 901 CGGTGTCGACGCGAGCATGCGTTTGTACCGCGAAGAGTCCTTCGGCCCGG 4 9 50 
pGlyValAspAlaSerMetArgLeuTyrArgGIuGluSerPheGlyProV 

4 951 TGGCGGTGGTGCTGCGCGGCGAGGGCGAAGAAGCGCTGTTGCAACTGGCC 5000 
alAlaValValLeuArgGlyGluGlyGluGluAlaLeuLeuGlnLeuAla 

5001 AACGACTCCG AGTTCGGTTTGTCGGCGGCG ATTTTC AGTCGTGAC ACCGG 5050 
AsnAspSerGluPheGlyLeuSerAlaAlallePheSerArgAspThrGl 

5051 CCGTGCCCTGGCCCTGGCCCAGCGGGTCGAATCGGGC ATCTGCC ACATC A 5100 
yArgAlaLeuAlaLeuAlaGlnArgValGluSerGlylleCysHisIleA 

5101 ACGGCCCGACCGTGCACGACGAAGCGCAAATGCCTTTTGGCGGGGTCAAG 5150 
snGlyProThrValHisAspGluAlaGlnMetProPheGlyGlyValLys 

5151 TCCAGCGGCT ACGGC AGTTTTGGCGGCAAGGC ATCGATTG AGC ATTTCAC 52 00 
SerSerGlyTyrGlySerPheGlyGlyLysAlaSerlleGluHisPheTh 

52 01 TCAGTTGCGCTGGGTCACCCTCCAGAATGGTCC ACGGC ACTATCCGATCT 525 0 - 
rGlnLeuArgTrpValThrLeuGlnAsnGlyProArgHisTyrProIleE 

5251 GA 5252 
nd 
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Nucleotide sequence of pFI901 (1.8Kb EcoRI/Pstl fragment ex pFI973) 

1 GAATTCGGGA TCTGGGCTGC CAACCAGTTG GAAGAAAAGA TTCTCG AACT 

SI CGGTGTCGAC AACGTCGGCG CCTTCATTGC CGAGCCGATC CAGGGCGCCG 

101 GCGGCGTGAT CGTGCCGCCA GAAAGCTACT GGCCGCGCAT CAAGGAAATC 

151 CTCGCCAAGT ACGACATCCT GTTCGTCGCC GATGAAGTGA TTTGCGGTTT 

201 CGGCCGTACC GGCGAGTGGT TCGGCAGCGA TTTCTACGAC CTCAAGCCCG 

251 ACATGATGAC CATCGCCAAG GGCCTGACTT CCGGCTACAT CCCGATGGGT 

301 GGTCTGATCG TGCGCGATTC GGTGGTCGAA GTGCTGAACG AAGGCGGCGA 

3S1 TTTCAACCAC GGATTCACCT ACTCCGGTCA CCCGGTAGCG GCGGCTGTTG 

401 CCCTGGAAAA CATCCGCATC ATGCGCGAAG AGAAGATTAT CGAGCGCGTC 

4 51 CAGGAAGAAA CGGCACCGTA TTTGCAAAAG CGTCTGCGTG AACTCAACGA 

501 TCATCCATTG GTGGGTGAAG TTCGCGGGGT AGGGTTGCTG GGCGCTATCG 

551 AACTGGTTCA GG AC AAAGCC ACGCGCAAAC GTTACGAAGG CAAGGGCGTG 

601 GGCATGATCT GCCGGCAGTT CTGCTTCGAC AACGGGCTGA TCATGCGCGC 

651 GGTTGGCGAC ACCATGATCA TCGCGCCGCC ACTGGTGATT ACCAAGGCGG 

701 AAATCGATGA GCTGGTCAGC AAGGCACGCA AGTGCCTGGA CCTGACCCTG 

751 AGTGTGTTGC AGGGCTAAGT GCTAGGCTCT GAGCGGGAGT TGTATGAACT 

801 TTCGCTCAGA GCGGTCAGAA AGCTTGGCCT TTCCTTGAAA GACCGCCATG 

851 GATGTTGCCA GACTAGCCAC CGTTCCAAAT GCCCGGGTTC GGCGCGGAAC 

901 AGGTGGTTCA AAAAAGCAAA AATTTGGAGC ATTACGCATG AAGGCACTCG 

951 GTAAAAAGCT CGCCGGCAAG ACACTCCTTG CCATGTCCCT GATGGGCATC 

1001 ATGGCGGGCG CGGTTCAGGC AGATGACAAA GTCTTGCACG TGTACAACTG 

1051 GTCCGATTAC ATCGCGCCGG ACACCATCAA GAAGTTTGAA GACGAGTCGG 

1101 GCATCAAGGT GGTCTACGAC GTCTTCGACA GTAACGAAAC CCTCGAAGCC 

11S1 AAGTTGCTGG CCGGC AAGTC CGGTTACGAC ATCGTGGTGC CTTCGAACAA 

1201 CTTCCTGGCC AAGCAGATCA AGGCCGGCGT CTACCAGAAG CTGGACAAGT 

12S1 CCAAGCTGCC GAACTGGAAC AACCTGAACA CCGATCTGCT CAAGGCCGTT 

1J0 1 TCCGTCAGCG ACCCTCCTAA CGACCACCCC TTCCCGTACA TGTGCCGCTf 
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1351 GATCGGCATC GGCTTCAACG 

14 01 ATGCACCGAC CAATTCCTGG 

14 51 AAGCTGAAAT CCTGTGGCAT 

1S01 TCCGGTGGCC CTGCACTACC 

1551 AACAACTGGC CGAGGCCGAG 

1601 GGTTACTTCC ACTCCTCCAA 

1651 CTGCGTGGCG ATCGGCTACT 

1701 CCGCCGAAGC CGGTGACAAG 

1751 GGTGCAGGCA GCTTCTACGA 

1801 CGTCGAAGGC GCCTACAAGT 



CCGAGAAGGT CAAGGCCGCG CTGGGTCCGC 
GACCTGATCT TCAAACCGGA AAACGCCGCC 
CAGCGTGCTG GATTCGCCAA CCGAGATGAT 
TGGGCTACCC GACCGACAGC CAGGACAAGA 
GCACTGTTCC TCAAAGTTCG TCCTTCGATC 
GTACATTTCC GACCTGGCCA ACGGCAACAT 
CGGGTGACAT CTATCAGGCC AAGACTCGCG 
GTCAAGGTCA GCTACAACAT TCCCAAAGAA 
CATGGTCGCC ATCCCTAAAG ATGCCGAAAA 
TCATGACCTT CCTGCAG 
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Nucleotide sequence pFI91 1 f850bp EcoRI/PstI fragment ex pFI973) 

1 CTGCAGACCT TCTGCCAGGC GCACCGGCTC ACGCAGGGTT TTGACTTCCT 
51 GGATCATCAG GCGCTGGTTG CGTTTGACCG ACTGGCCAAT CATCAGGCGA 
101 AACGCATTGG AGCCCAGGTC GATAGCGGCG AATAGCGATG CGTCTTCTTT 
151 CACGTGAGGA ACTCCTGGCA ACTTCGTCCG CCGAGGGCAA AAAACCGGTT 

201 TTGCCGATCC TGCACGGGGT AGATGACATC AGGATGACAT TGG AAATTTT 

251 TCTGACAGAC GTTTCGTCAC CAGAACGTCA CAGTCGCGGG GCTAGCATCG 

3 01 GGGCTTCCAA TCGGGTCGGG AGCCTTGAAC ATGCTGTTAA CCAACGACAC 

351 CCTGATGCAT CGCATCCACC GCGAGTTGCT CGACCACAGT GACGAAGAGC 

401 TGGAACTGGA GTTGCTGGAA GACGATCACG ACCTGGCTTC GCTGTTCGCG 

451 GATCAACCGG GCGATACCCC GGCCAAGGCC GAGCGCCGTC GTTACTTCAG 

501 CGAGTTGTTC CGTTTGCAGG GCGAGTTGGT CAAGTTGCAA AGCTGGGTGG 

551 TGAAGACCGG GCACAAGGTG GTGATTCTGT TCGAAGGCCG CGATGCCGCT 

601 GGCAAAGGGG GCGTGATCAA GCGCATCACC CAGCGTCTTA ATCCACGGGT 

651 CTGCCGGGTC GCGGCGCTTC CCGCGCCGAG TGACCGCGAG CGCACCCAGT 

7 01 GGTATTTCCA GCGTTATGTC TCGCACCTGC CCGCCGCCGG CGAGATCGTC 

751 CTGTTCGACC GCAGCTGGTA CAACCGCGCC GGTGTCGAGC AGGTGATGGG 

801 CTTTTGCAAC GAGGAACAGT ACGAAGAATT TTTCCGCAGC GTGCCGGAAT 

851 TC 



Figure 14 
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Nucleotide sequence pFI912 f958bp EcoRI/PstI fragment ex pR793^ 

1 CTGCAGGGCC TGGGGCATGC CGAGTCGGCG TCGCAGAACG CCTCTGCCTA 

SI TGCGCTGGAA CGCAAGCAAA TGCGTGCGCC CGCTCGCCCG GTCGGAGTCG 

101 AAGCCGAAGT GGCCGACCCC ATTCATTTTC ATCCGGCCAT GCGCCGGGTG 

151 TTGCTGGAAC TGAGGGCCTA TGCCGAGGGC ATGCGTGCGG TCGGTTACTG 

201 GGCGGCGCAT TTGTTGGATC AGTCCGAGCA GGCCGAGGAT CTGCCCACTC 

251 GTCAGCGCGC CTTGCAACTG GCGGAGCTGC TGACGCCGGT GATCAAGGCG 

3 01 TTCTTCACCG AGCAGGGTTT TCGCCTGGCC AGCAACGCCT TGCAGGTGTT 

351 CGGTGGCTAC GGCTACGTCA GCGAGTTCGC CATCGAACAG ACCCTGCGCC 

401 ACAGCCGGAT CGCGATGATT TACGAGGGCA GCAACGAAAT CCAGGCCAAT 

451 GACCTGCTGC TGCGCAAAGT GCTGGGGGAT GAAGGTCGCG CCTTTGGCCA 

501 ACTGTTGGCG GTCATGCGCG AAGAGGCCGA ACTGGCCTGT AACGACACCC 

551 GCTTTGGCGC TGAGCTGGTG CAGCTGTGCG ACAAACTCGA GACAGTGCAA 

601 CTTGAGATAG GGGACCTCGC CGTCAGGGAG CGCGAATACC CGTATCGAGC 

651 CGCTGGCGAT TTCCTGCGCC TGTGTGGCGT GGCGCTGTTG GGGTTTTCCT 

701 GGGCGAGAGC GGC ACGGGTG TCTCGCCTGT TACCTGACAG CGATCCACTG 

751 CGTCCCAACA AACTGG AAAC CGCGCGTTTC TTCTTTGCCT ACCTGCTGCC 

801 AGAAGCCGAT CAACGCCTCG CAGCCATTCG GGCGGCGAGA GCGCCGTTGC 

851 CGTTTTTGAT CTGAAAAAAC GCCCGCCAGG CCCAATGTGG CTCGCTCCCA 

901 CAAACAGCGC GAACCACATC GAGCCACCGC CGCCACGCCA GTTGTACAGG 

9 51 CCGAATTC 



Figure IS 



BNSDOCIO: <WO 9735999A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 97/35999 



PCT/GB97/00809 



21/27 

Nucleotide sequence pFI913 f959bp EcoRI/PstI fragment ex pFI793) 

t CTGC AGGCTT GCCATATCAG TGGCGACAGC TTCGTCCGCG CTCCAGGCGC 

51 AAGGGCCAAC GACCTGCCGA CGGTGCCGAA TAGGCTGTCG GCGTCCGTTA 

101 TTCTGGACGC ACCGC AAAAA CTGTTATTTA CCCGGTCTTC TTCCACTGTA 

151 GAACCTTTTC ACT AT AGCGG CCCTGCGTGT TCTGCGGGAG CTGCTCATGA 

201 TTCTGCACGC GATTCCACTT CCAGCCCGTT GCCGCGCCGT GCTGTTGCGG 

251 TTTCTGCACG CACGGCTTTT GCATCAGGCT TGCACAGCCA GCCACAAGGG 

301 CAGGTAAGCT CTAGCTCGCA CGTCCTGGGC GTCTCCCAGG TCTGCCAACG 

351 CGACGCGGAC GCGTCAAACA ACGCCCGGCC CCTAATGAAG CCGGG AC ACT 

4 01 CAGCCCAGAG GCATTTATGA GTAACAACCT CGACCAGCTC ACCGATTGGT 

451 TGAAAGACCA CAAGATCACA GAAGTCGAAT GCATGATTGG CGACTTGACC 

501 GGGATCACCC GCGGCAAGAT CTCGCCAACC AACAAGTTCA TTGCCGAAAA 

551 AGGCATGCGC CTGCCCGAGA GTGTGCTGTT GCAGACAGTG ACGGGCGACT 

601 ATGTCGAAGA CGACATCTAT TACGAACTGC TCGACCCGGC CGACATCGAC 

651 ATGATCTGCC GCCCCGACCA GAACGCGGTG TTCCTCGTGC CATGGGCCAT 

701 CGAGCCGACC GCGCAGGTGA TTCACGACAC CTACGACAAG CAGGGCAACC 

751 CGATCGAGCT GTCGCCACGC AACGTCCTCA AGAAAGTCCT CAAACTCTAT 

801 TCCGACAAGG GCTGGCAGCC GATCGTGGCG CCGGAAATGG AGTTCT ACCT 

851 GACCAAGCGC AGTGACGACC CGGATTACCC ATTGCAACCG CCGGTTGGCC 

901 GTTCCGGACG TCCGGAAATC GGTCGCCAAT CGTTCTCTAT CG AAGCGGCC 

9 51 AACGAATTC 
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Nucleotide sequence merged contigs pFI913/PCR product/pR901(4259bp) 

1 CTGCAGGCTT GCCATATCAG TGGCGACAGC TTCGTCCGCG CTCCAGGCGC 

51 AAGGGCCAAC GACCTGCCGA CGGTGCCGAA TAGGCTGTCG GCGTCCGTTA 

10 1 TTCTGGACGC ACCGCAAAAA CTGTTATTTA CCCGGTCTTC TTCCACTGTA 

151 GAACCTTTTC ACTATAGCGG CCCTGCGTGT TCTGCGGGAG CTGCTCATGA 

201 TTCTGCACGC GATTCCACTT CCAGCCCGTT GCCGCGCCGT GCTGTTGCGG 

251 TTTCTGCACG CACGGCTTTT GCATCAGGCT TGCACAGCCA GCCACAAGGG 

301 CAGGTAAGCT CTAGCTCGCA CGTCCTGGGC GTCTCCCAGG TCTGCCAACG 

3 51 CGACGCGGAC GCGTCAAACA ACGCCCGGCC CCTAATGAAG CCGGGACAGT 

401 CAGCCCAGAG GCATTTATGA GTAACAACCT CGACCAGCTC ACCGATTGGT 

451 TGAAAGACCA CAAGATCACA GAAGTCGAAT GCATGATTGG CG ACTTGACC 

501 GGGATCACCC GCGGCAAGAT CTCGCCAACC AACAAGTTCA TTGCCG AAAA 

551 AGGCATGCGC CTGCCCGAGA GTGTGCTGTT GCAGACAGTG ACGGGCGACT 

601 ATGTCGAAGA CGACATCTAT TACGAACTGC TCGACCCGGC CGACATCGAC 

651 ATGATCTGCC GCCCCGACCA GAACGCGGTG TTCCTCGTGC CATGGGCCAT 

701 CGAGCCGACC GCGCAGGTGA TTCACGACAC CTACGACAAG CAGGGCAACC 

7 51 CGATCGAGCT GTCGCCACGC AACGTCCTCA AGAAAGTCCT CAAACTCTAT 

801 TCCGACAAGG GCTGGCAGCC GATCGTGGCG CCGGAAATGG AGTTCTACCT 

851 GACCAAGCGC AGTGACGACC CGGATTACCC ATTGCAACCG CCGGTTGGCC 

901 GTTCCGGACG TCCGGAAATC GGTCGCCAAT CGTTCTCTAT CGAAGCGGCC 

951 AACGAATTCG ACCCGCTGTT CGAAGACGTC TACGACTGGT GCGAACTGCA 

1001 GGAGCTGGAT CTCGATACGC TGATCCACGA AGACGGCACG GCGCAGATGG 

1051 AAATCAACTT CCGTCACGGC GACGCGCTGT CCCTGGCCGA CCAGATCCTG 

1101 GTGTTCAAGC GCACCATGCG CGAGGCCGCG CTCAAGCACA ACGTGGCCGC 

1151 CACGTTCATG GCCAAGCCGA TGACCGGCGA GCCTGGCAGC GCCATGCACC 

12 01 TGCACCAGAG CATCATCGAT ATCGAGACCG GCAAGAACGT CTTCTCCAAT 
125 1 GAAGACGGGA GCATGAGCCA CTTGTTCCTC AACCACATCG GCGGCCTGCA 

13 0) CAAATTCATC CCTGAACTGC TCCCGCTCTT CCCCCCCAAr GTCAACTCGT 
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1351 TCCGCCGCTT CCTGCCGGAC ACTTCGGCGC CGGTGAACGT CGAGTGGGGC 

1401 GAAGAAAACC GTACCGTGGG CCTGCGGGTG CCGGATGCCG GCCCTCAAAA 

14 51 CCGTCGGGTG GAAAACCGCC TGCCGGGTGC CGACGCCAAC CCGTACCTCC 

1501 CGATTGCCGC GAGCCTGCTG TGCGGCTACA TCGGCATGGT CGAAGGTATC 

1551 AACCCAAGCG CGCCTGTGGT GGGTCGTGGT TACGAGCGGC GCAACCTGCG 

1601 TCTGCCGCTG ACCATCGAAG ACGCTCTGGA ACGCATGGAA AACAGCAAGA 

1651 CCATCGAGAA ATACCTGGGT CACAACTTCA TCACTGGCTA CGTCGCGGTC 

1701 AAGCGGGCCG AGCATGAAAA CTTCAAGCGC GTGATCAGCT CATGGGAACG 

17 51 GGAATTCCTG TTGTTCGCCG TCTGACACGC CGGGTGCGGC CCTCAAAAGC 

1801 CGCACTCCAA CCTCACTAGG AGAGCTTTAT GAGCAACAAC CCGCAAACCC 

1851 GTGAATGGCA GAACCTGAGC GCCGAACACC ACCTGGCCCC CTTCAGTGAC 

1901 TTCAAGCAAT TGAAGGAAAA AGGCCCGCGC GTCATCACCA GCGCCAAGGG 

1951 CGTTTACCTG TGGGACAGCG AAGGCAATCA GATCCTCGAC GGCATGGCCG 

2 001 GCCTGTGGTG CGTGGCCATC GGTTACGGCC GCGACGAGTT GGCCGAGGCT 

2051 GCCAGCAAGC AGATGCGCGA GTTGCCGTAC TACAACCTGT TTTTCCAGAC 

2101 CGCTCACCCG CCCGTCCTCG AGCTGGCCAA GGCAATTTCC GATATCGCGC 

2151 CAGCAGGCAT GAACCACGTG TTCTTCACCG GTTCCGGCTC CGAAGGCAAT 

22 01 GACACCATGC TGCGCATGGT TCGCCACTAC TGGGCGATCA AAGGTCAGCC 

22 51 AAACAAGAAA GTCATTATCA GCCGCAAGAA CGGCTACCAC GGTTCGACCG 

2 3 01 TGGCCGGCGC CAGCCTGGGC GGCATGACCT ACATGCACGA ACAGGGCGAC 

2 3 51 TTGCCGATCC CGGGCATCGT GCACATTCCG CAGCCGTACT GGTTCGGTGA 

24 01 AGGCGGCGAC ATGACCCCGG AAGAATTCGG GATCTGGGCT GCCAACCAGT 

24 51 TGGAAGAAAA GATTCTCGAA GTCGGTGTCG ACAACGTCGG CGCCTTCATT 

2501 GCCGAGCCGA TCCAGGGCGC CGGCGGCGTG ATCGTGCCGC CAGAAAGCTA 

2 5 51 CTGGCCGCGC ATCAAGGAAA TCCTCGCCAA GTACGACATC CTGTTCGTCG 

2 601 CCGATGAAGT GATTTGCGGT TTCGGCCGTA CCGGCGAGTG GTTCGGCAGC 

2 651 GATTTCTACG ACCTCAAGCC CGACATGATG ACCATCGCCA AGGCCCTGAC 

27 01 TTCCGGCTAC ATCCCGATGG GTGGTCTGAT CGTGCGCG AT TCCGTGCTCG 

■V/bl AAGTGCTGAA CGAAGGCCGC GATTTCAACC ACGGATTCAC CTACTCCCGT 

Figure 19 (Part 2 of 4) 
SUBSTITUTE SHEET (RULE 26) 

BNSDOCID <WO 9735999A2J_> 



WO 97/35999 — PCT/GB97/00809 

26/27 

2 801 CACCCGGTAG CGGGGGCTGT TCCCCTGGAA AACATCCGCA TCATGCGCGA 

2 851 AG AG AAG ATT ATCGAGCGCG TCCAGGAAGA AACGGCACCG TATTTGCAAA 

2 901 AGCGTCTGCG TGAACTCAAC GATCATCCAT TGGTGGGTGA AGTTCGCGGG 

2 951 GTAGGGTTGC TGGGCGCTAT CGAACTGGTT CAGGACAAAG CCACGCGCAA 

3 001 ACGTTACGAA GGCAAGGGCG TGGGCATGAT CTGCCGGCAG TTCTGCTTCG 
3051 ACAACGGGCT GATCATGCGC GCGGTTGGCG ACACCATGAT CATCGCGCCG 
3101 CCACTGGTGA TTACCAAGGC GGAAATCGAT GAGCTGGTGA GCAAGGCACG 
3151 CAAGTGCCTG GACCTGACCC TGAGTGTGTT GCAGGGCTAA GTGCTAGGCT 
3201 CTGAGCGGGA GTTGTATGAA CTTTCGCTCA GAGCGGTCAG AAAGCTTGGC 
3 251 CTTTCCTTCA AAGACCGCCA TGGATGTTGC CAGACTAGCC ACCGTTCCAA 
3 3 01 ATGCCCGGGT TCGGCGCGGA ACAGGTGGTT CAAAAAAGCA AAAATTTGGA 
3 351 GCATTACGCA TGAAGGCACT CGGTAAAAAG CTCGCCGGCA AGACACTCCT 
3401 TGCCATGTCC CTGATGGGCA TCATGGCGGG CGCGGTTCAG GCAGATGACA 
34 51 AAGTCTTGCA CGTGTACAAC TGGTCCGATT ACATCGCGCC GGACACCATC 
3501 AAGAAGTTTG AAGACGAGTC GGGCATCAAG GTGGTCTACG ACGTCTTCGA 
3 551 CAGTAACGAA ACCCTCGAAG CCAAGTTGCT GGCCGGCAAG TCCGGTTACG 
3 601 ACATCGTGGT GCCTTCGAAC AACTTCCTGG CC AAGCAGAT CAAGGCCGGC 
3 651 GTCTACCAGA AGCTGGACAA GTCCAAGCTG CCGAACTGGA AGAACCTGAA 
3701 CACCGATCTG CTCAAGGCCG TTTCGGTCAG CGACCCTGGT AACGAGCACG 
37 51 CCTTCCCGTA CATGTGGGGC TCGATCGGCA TCGGCTTCAA CGCCGAGAAG 
3801 GTCAAGGCCG CGCTGGGTCC GGATGCACCG ACCAATTCCT GGGACCTGAT 

3 851 CTTCAAACCG GAAAACGCCG CCAAGCTGAA ATCCTGTGGC ATCAGCGTGC 
3901 TGGATTCGCC AACCGAGATG ATTCCGGTGG CCCTGCACTA CCTGGGCTAC 
3951 CCGACCGACA GCCAGGACAA GAAACAACTG GCCGAGGCCG AGGCACTCTT 

4 001 CCTCAAAGTT CGTCCTTCGA TCGGTTACTT CCACTCCTCC AAGTACATTT 
4 051 CCGACCTGGC CAACGGCAAC ATCTGCGTGG CGATCGGCTA CTCGGGTGAC 
4 10 1 ATCTATCAGG CCAAGACTCG CGCCGCCGAA GCCGGTGACA AGGTCAAGGT 
4 15 1 CAGCTACAAC ATTCCCAAAG AAGGTGCAGG CAGCTTCTAC GACATGGTCG 
4 20 1 CCATCCCTAA A( I ATf ;f*PGAA AACOTCGAAG GCGCCTACAA GTTCATGACC 
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4 2 51 TTCCTGCAG 
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